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Preface to the English Edition 


The English version of this originally Russian book has been substantially supplemented by new material, 
and is almost 50 percent larger than the original. Part of the new material has been prepared in cooperation 
with colleagues. We express our deep gratitude to all of them: A. Klimov, A. Kostochka, I. Kan, I. Rival, V. 
Shmatkov, K. Rybnikov, A. Malykh, S. Sal'nikov, N. Sauer, A. Sidorenko, J. McIntosh, W. Kocay, C. 
Dodson, S. Radziszowski, V. Rodl, R. Wilson, and Gy. Katona. 


In particular, Jack McIntosh offered the use of the word 'packability' for our very new Russian term 
‘vlozhimost". 


Our special thanks go to the translator. Having made attempts ourselves to translate combinatorial literature 
into Russian, we know how difficult it is to render correctly thoughts sometimes expressed in ponderous 
language or overloaded with meaning. However, we believe that even this paragraph has been successfully 
translated. 


We thank KLUWER Acad. Publ., who ventured to undertake this step, although we think that the risk has 
been substantially reduced by the superb coordination of the entire work done by Margaret Deignan, to 
whom we express our deepest gratitude. 


Preparation of the English version of the book was partially supported for the second author by the Russian 
Foundation for fundamental research with its Algebraic Combinatorics grant #93 — 011 - 1442. 
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Introduction to the English Edition 


As far as we know, this is the first Russian book on general combinatorics that has been translated into 
English. In recent decades, the reverse process has occurred: most Western monographs, conference 
proceedings, and some collections of papers on combinatorics have been translated into Russian and 
published in large printruns. 


In the post-War period, a vigorous development of combinatorial research has taken place in Russia on a 
two-fold basis: translated sources together with Russian books, conference proceedings and articles, and, 
finally, a journal specifically devoted to combinatorics. Russian combinatorists have been more fully 
informed than their Western colleagues. When reading our book one may get the feeling that we have 
devoted insufficient attention to foreign results. The Russian reader will know better: we exactly know its 
information sources and, candidly, have not feared the appearance of this translation. The book was written 
for Russian readers, with Russian objectives. 


One such objective was to attract young people to the thematics of extremal problems and to combinatorics 
as a field of study. Therefore, in part, the book has the features of both a textbook and a reference handbook 
and can be read by mathematics students and beginning engineers. We have happily seen this objective 
realized in that the work of one of these students is presented in this English version by advances on the 
Frobenius problem. 


Another objective is shown in our attempt to extend the extremal approach to solving a large class of 
problems including some previously regarded as exclusively algorithmic. Unfortunately, the problem 'P = 


NP' has, from time to time, baffled not just theorists. 


Related to this very objective is a third one (chronologically, the first): to broaden the freedom of choice of 
theoretical bases for modeling real phenomena in order to completely solve practical problems. 


The real phenomenon which prompted this whole project is this: if a large number of problems (say, 108) are 
simultaneously solved by a computer, ‘crowding’ (fragmentation) of its memory occurs, sharply increasing 
both the total time and the separate time for solving each problem. This is sometimes of essential and even 
fundamental importance, for example, in the detection and servicing (destruction) of a set of fast-flying 
targets. And if their fly-up time (say, to Moscow) is five to eight minutes, then every computer second 
gained turns into entirely concrete reality. 


The method is sufficiently universal, since every computer has a memory—even 
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the abacus, which, to this day, remains an unsurpassed configuration: it is simultaneously a carrier of 
memory, a processor, and a display, albeit necessarily with a person present. 


VI. BARANOV 
B.S. STECHKIN 
MOSCOW 

27 JANUARY 1993 
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Preface 


This book is the result of the close cooperation between an engineer and a mathematician to develop 
methods for solving problems arising in the creation of automatic control systems. The main result of this 
cooperation is a combinatorial model, packability of number partitions, expounded in this book. 


A study of packability of number partitions preceded the analysis of a number of practical problems arising 
in the design of efficient methods for controlling computer memory allocation, in the development of 
methods for the analysis of programming techniques for automatic control systems, etc. The choice of 
combinatorial methods was predetermined by the creation of the new, practically important, theme of 
extremal combinatorial problems on packability of number partitions. This combinatorial direction turned 
out to be useful not only in the formalization and solution of a number of problems in engineering, but also 
in solving a class of extremal problems on graphs. 


The aim of this book is to let engineers and mathematicians become acquainted with the methods developed 
by the authors for solving a number of applied and mathematical problems. The material of the book is 
divided into 5 chapters. 


Chapter | is a short guide to the necessary combinatorial notions. In particular, next to all elementary 
combinatorial schemes it explains a listing scheme developed by the authors, enabling us to unify the 
simplest combinatorial schemes. 


Chapter 2 contains the basic mathematical results in the study of the packability of number partitions, and 
presents the, up till now, most complete summary of results in this direction. As illustration of the 
applicability of these results we note their relation to old weighing problems and other problem statements. 
We give as exercises some problems and assertions concerning the packability of number partitions. 


Chapter 3 is devoted to getting acquainted with extremal problems on graphs and systems of sets. We show 
their relation with results on the packability of number partitions. 


In Chapter 4 we study certain extremal problems in geometry, and apply the results of having solved them. 


In Chapter 5 we give methods for using the results of solving extremal combinatorial problems concerning 
the packability of number partitions in the design of ACS. Here we give combinatorial models for studying 
processes to control the design of ACS and computer memory allocation. We demonstrate the applicability 
of the 
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packability theorem for computing the size of operation computer memory, and give definitions of a number 
of new concepts in engineering related to the application of methods of combinatorial analysis to the 
investigation of the performance of ACS. We propose a new method for estimating the efficiency of 
algorithms, characterized by extremal bounds. 


The authors express their gratitude to all experts who helped in obtaining the results exposed in this book. In 
particular, O.V. Viskov, R.L. Graham, Ya. Demetrovich, Gy. Katona, Yu.V. Matiyasevich, S.G. Sal'nikov, 
and P. Erdos. The authors also thank A.F. Sidorenko, for supplementing the material of Chapter 3 by results 
concerning forbidden subgraphs and Ramsey numbers, and for taking part in writing the first two sections of 
Chapter 4. The authors express deep gratitude to A.A. Gushchin, V.K. Krivoshchekov, and A.A. Tsypkin for 
their great help in writing computer programs for obtaining the numerical results in Chapter 2. 


Special thank of the authors goes to the reviewers, whose remarks not only helped in improving the book, 
but also influenced its structure. 
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Analysis Correspondence 


(A Historical Perspective on Combinatorics) 


The origins of systematic combinatorial research can be found in the works of 
Blaise Pascal and Pierre de Fermat. Questions about Chevalier de Méré’s game led 
to the distinction between individual combinations and the calculation of favorable 
outcomes. Three chapters of Jakob Bernoulli’s book ‘Ars Conjectandi’ are devoted 
to the first systematic exposition of combinatorial facts. The works of Bernoulli 
and Gottfried Leibniz helped to establish combinatorics as an independent subject. 
In particular, Leibniz was first to attempt a full understanding of combinatorics in 
his dissertation ‘Ars Combinatoria’, in which, apparently, the term ‘combinatorics’ 
originated. 

In the Russian mathematics the term ‘kombinatorika’ (combinatorics) was not 
immediately adopted, the preferred phrase being ‘teoriya soedinenii’ (theory of 
joinings /combinations), which completely reflects the essence of the subject. 

The basic objective notion in combinatorics is that of a correspondence. Com- 
binatorics is the analysis of correspondences among properties of objects so as to 
study these properties and/or objects. Such analysis is difficult on account of the 
interdependence of such properties. Combinatorics consists in the study of corre- 
spondences and combinations of elementary mathematical objects—numbers, sets, 
and figures. 

Methodologically, combinatorics is founded upon the three attributive properties 
of sets: distinctness, orderliness, and wholeness. Combining gives rise to all of the 
elementary combinatorial tools: distinctness—multiset, orderliness—permutation, 
and wholeness—partition. Objects of combinatorial study may include not only 
mathematical objects, but also any practical object, whether objects, people, ac- 
quaintanceship, or assertions. It is this very freedom in the selection of objects for 
investigation which provides the simplicity, accessibility, and practical significance 
of combinatorial results, and at times also their mystical breadth. 

In the second half of the 19th century, the fundamentals of the theory of com- 
binations began to be included in compulsory courses on algebra at high schools 
(gimnaziya) and technical schools in Russia and other countries. Deeper study 
of arrangements and combinations of objects was conducted in those branches of 
mathematics to which these objects belong: analysis, algebra, geometry, number 
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theory, set theory, and logic. This, in turn, was reflected in the specificity and di- 
versity of applications, and also in the formulation of the basic types of problems. 
Furthermore, all combinatorial topics are strongly interrelated, and unite in a single 
subject—-combinatorics. General combinatorics encompasses all of these topics. 

Towards the beginning of the 20th century, combinatorial analysis, as the mathe- 
matical analysis of functions of a discrete argument, ‘occupied the ground between 
algebra and higher arithmetic’, in the words of MacMahon, who also noted a ten- 
dency towards ‘combinatorial attack in other territories’. The more pronounced this 
process, the more effective the methods of combinatorics became as they acquired 
new methods in the course of this very expansion. Two factors influence combina- 
torial investigations and their development in separate directions and on different 
topics: the type of subject, i.e. the selected object of investigation; and the type of 
problem, i.e. the selected aim of investigation. The choice depends on the needs ob- 
served and the possibilities available. The development of subject, in turn, enriches 
both the needs and the possibilities. 

Elementary quantitative analysis of arrangements and combinations forms the 
basis of the traditional type of problem in combinatorics—enumeration. The devel- 
opment of enumeration is the main source of combinatorial analysis. The method 
of generating functions is commonly used in combinatorial analysis, and was histor- 
ically the first method used. Developed by Leonard Euler for the theory of number 
partitions, this analytical method also turned out to be powerful in combinatorics. 
It has been developed into refined forms such as the method of Dirichlet generat- 
ing functions, the method of trigonometric sums, and the method of characteristic 
functions: methods applicable not only in combinatorics and number theory. The 
method of generating functions was developed mainly for use in partition problems. 
One of the most brilliant moments in this development was the creation of the ‘circle 
method’, originally invented for counting the partitions of a given number. 

Another type of problem in combinatorics is structural in nature. Such are best 
known in graph theory. Graph theory is a branch of combinatorics dealing with 
another kind of elementary relation on sets and systems of sets. However, the 
emergence of this branch occurred at a time when correspondence and relation had 
not yet been distinguished as independent mathematical concepts, but appeared 
only in terms of other notions, mainly of a geometrical and topological nature. 


‘Iam not content with algebra, in that it yields neither the short- 
est proofs nor the most beautiful constructions of geometry. Conse- 
quently, in view of this I consider that we need yet another kind of 
analysis, geometric or linear, which deals directly with position, as 
algebra deals magnitude... Analysis situs. I think that I have at my 
disposal such a means, and that figures and even machines and mo- 
tions can be represented using symbols, as algebra represents numbers 
and magnitudes. ... I have yet to add a remark on the fact that I re- 
gard it possible to extend a characteristics to things inaccessible to 
perceptible imagination; but this is too important and too far fetched 
for me to be able to explain in this account in few words’. 


So wrote Leibniz to Christiaan Huygens on September 8, 1679. In this letter, using 
the example of a kind of geometrical approach, Leibniz is searching for a general 
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means for a formal operation with correspondences. The term ‘situs’ (position, 
location) itself may be understood as a correspondence between object and place. 
For the rest of his life, Leibniz did not abandon this dream. 15 years later he wrote 
to Hopital: 

‘... I would like to have the possibility of realizing it, but dry and 

abstract thoughts excite me too much.... I was often not healthy in 

the past year, so that for a long time I had to restrain myself, although 

I did not succeed in this to the extent I would have wished’. 
Leibniz’s dream was ahead of its time, but, as turned out, not far ahead An amusing 
topological brain-teaser, the walking tour of the seven bridges of Konigsberg, would 
be decisive. Euler derived necessary and sufficient conditions for the existence of 
such a walking tour in full generality, thereby lying the rudiments of graph theory. 
The original problem was as follows: Is it possible to walk across each bridge once 
and only once and return to the starting point? By representing the connected land 
regions by points and the bridges by lines, a graph can be drawn, and the question 
is that of the possibility of traversing the graph from point to point along the lines 
such that each edge is traversed once. On August 26, 1735, Euler presented a paper 
to the Academy of Sciences in St. Petersburg, ‘Solutio problematis ad geometriam 
situs pertinentis’. In it he established local conditions for the realizability of such 
a traversal, which is now called an Euler tour. A graph has an Euler tour if and 
only if it is connected and has an even number of edges at each vertex. The graph 
of the Konigsberg bridges does not satisfy this property. In the same paper Euler 
established that in any graph the sum of the degrees of the vertices equals twice the 
number of edges. Euler began his paper by referring to Leibniz as follows: 


‘In addition to that branch of geometry which is concerned with mag- 
nitudes and which has always received the greatest attention, there is 
another branch, previously almost unknown, which Leibniz first men- 
tioned, calling it the geometry of positions (Geometria Situs). This 
branch is concerned only with the determination of position and its 
properties; it does not involve measurements, or calculations made 
with them ...’. 
In this connection special significance attaches to an epistolary testimony of Euler 
(recently noted by A.E. Malykh and K.A. Rybnikov ). On March 13, 1736, Euler 
wrote to J. Marioni about the problem of the bridges: 
‘This problem, although commonplace, seems to me, however, worthy 
of attention, since for its solution neither geometry, nor algebra, nor 
the combinatorial art suffices. Therefore a thought arose within me, 
which is not only accidentally related to the geometry of position 
(Geometria Situs) which was investigated by Leibniz in his time’. 


This direct testimony indisputably stresses the inclusion within modern combina- 
torics not only of ‘Ars Combinatoria’, but also of ‘Analysis Situs’. Thus, the notion 
of a graph as a system of two-element subsets (edges) of a set (of vertices) arose 
and was studied on the basis of its topological nature. The graph planarity crite- 
rion derived by Kazimierz Kuratowski extended the conception of graphs: A graph 
can be drawn in the plane by points and non-intersecting line segments if and only 
if it does not contain a subgraph homeomorphic to the graphs Ks or K33. This 
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means that the ‘topological nature’ of a graph is completely determined by its set- 
theoretical structure. Therefore topological problems in graph theory from a sepa- 
rate subject matter, in particular, that of map coloring problems and problems on 
embedding graphs in manifolds. The graphical representation of arrangements and 
combinations by geometrical figures in combination with euclidean geometry led to 
the creation of the theory of matroids, and of combinatorial and of finite geometries. 

The highly abstract nature of algebra, logic, and set theory not only brought 
about their application to relations amongst objects of arbitrary kind, but also 
made possible the solution of problems having to do with the realization of concrete 
structural phenomena, thereby lying the foundation of yet another direction: the 
algorithmic. 

The characterization of the limiting possibilities of combinatorial relations forms 
the essence of yet another type, the extremal combinatorial problem, i.e., in general, 
the search for an answer to a question which can be stated in the words of Chebyshev: 


‘The major part of practical questions leads to problems involving 
largest or smallest magnitudes, which are totally new for science and 
only by solving these problems can satisfy the requirements of practice 
which always aim at the best, the most appropriate.... The coming 
together of theory and practice gives most favorable results, and not 
only practice gains by this; all of science develops under its influence, 
it discloses new subjects for investigation or new angles for considering 
long known subjects. Despite the high degree of development taken 
by science, practice clearly discloses the incompleteness of science in 
many respects; it poses problems which are essentially new for science, 
and in this manner calls for the search of totally new methods. If a 
theory can gain from new applications of old methods, or from new 
developments of it, then it gains even more by disclosing new methods, 
and in this case practice is a loyal guide of science.’ 


One of the first independent appearances of an extremal problem was geometrical, 
and dates back to 1611, when Johannes Kepler first described a means by which a 
sphere can e surrounded by twelve balls of the same radius such that all these balls 
touch the central sphere. 83 years later I. Newton and David Gregory quarreled 
about the question of how many balls of equal size one can distribute in this manner 
around a central sphere of the same radius. Newton asserted 12, and Gregory 
13. The solution of this problem (twelve is correct) has taken over 200 years, and 
the search for a simple proof continues today. Studying the corpescular model of 
the structure of matter, Mikhail Lomonosov gave estimates of the coefficients of 
contraction of matter based on a density comparison for various manners of filling 
space with unit balls. 

At approximately the same time, at the request of the Russian government, Chris- 
tian Goldbach successfully attempted a mathematical approach to code-breaking, 
for which he was complemented by Counsellor Bestuzhev: “There is a key for every- 
thing written in code, thanks to the art of Mr. Goldbach’ (does this not underline 
the term ‘key to a code’ ?). 

Many problems of present-day coding theory can be formulated as extremal ge- 
ometrical problems in Hamming space, e.g., the maximal cardinality of a uniform 
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code of weight k and distance a is equal to the maximum number of vectors of norm k 
in a Hamming space for which the difference of any pair has norm at least a; clearly, 
this is an analog of the contact number. Thus, even in the period of development of 
the topic of extremal geometrical problems, the sphere of its possible uses already 
began to take shape. One of the earliest extremal number-theoretical results is due 
to James Sylvester, whose theorem asserts: Let r and s be mutually prime natural 
numbers, and let n(r,s) be the largest integer n that cannot be represented in the 
form n = ar-+ bs, where a and 6 are non-negative integers. Then n(r,s) = rs—r—s. 
The same question for three or more mutually prime numbers is still open, and is 
called the Frobenius problem. 

Extending the domain of application of theoretical combinatorial results leads 
to the emergence of another important type of problem, combinatorial modeling. 
Here, the choice of the most suitable combinatorial treatment of applied problems 
is determined by finite collection of their solutions. The high degree of abstraction 
of each combinatorial model makes it possible to investigate by this model a certain 
well-defined number of processes or phenomena in various domains of knowledge. 
Hence, the unification of such models into complexes, followed by the determination 
of a way of finding correspondence rules between the models, which, in turn, will 
depend on the problems to be solved by this complex of models, substantially extends 
the domain of their applicability. 

In turn, this leads to another type of problem: the study of correspondences be- 
tween distinct models. The main aim of this type is the creation of unified complexes 
of combinatorial models suitable for the adequate description not only of specialized 
practical problems, but also of processes and phenomena belonging to particular 
domains of knowledge. 

We have to add a few comments about a relatively young topic, namely, algebraic 
combinatorics. This is first of all the study of compound structures (partial orders, 
binary relations and correspondences) by means of which one constructs in a suitable 
manner pure algebraic objects (incidence algebras, etc.). At first glance such alge- 
bras (introduced for arbitrary binary relations) may seem the ultimate in abstract 
constructions. It turns out that they are far from abstract, and Richard Stanley’s 
result serves as a first substantiation of this. Nowadays there is a whole field arising 
from Stanley’s theorem, which asserts that locally finite posets are isomorphic if and 
only if their incidence algebras are isomorphic (as abstract algebras). Thus, the inner 
structure of the original binary relation is uniquely determined by the correspond- 
ing algebra of functions. And this result holds for a large class of binary relations 
and algebras. Thus, the apparatus of incidence algebras may serve as a model for 
actual phenomena that can be adequately described by sets with binary relations. A 
concrete example is that of a combinatorial medium which includes such sets in the 
form of electronic networks and strategic analysis. Precisely the construction of al- 
gebras of functions over arbitrary binary relations has crucially extended the sphere 
of modeling actual phenomena, which previously had been restricted by the axioms 
of partial order. In particular, knowledge of the Mobius function of the incidence 
algebra over the binary relation provides full knowledge of the inner structure of this 
binary relation. The complexity of this structure corresponds to the complexity of 
this function and its forms, including forms which are nonstandard from the point 
of view of classical analysis. This young branch is so powerful that it merits more 
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than an equal place among the general topics listed above, as it also changes the 
inner structure of combinatorics as a whole. 

Combinatorics can serve as practice and as theory. In its formative period it 
proved practical for probability theory, confirming and suggesting its methods and 
laws; as theory it was outstandingly successful in solving problems. This remarkable 
duality also appears in extremal problems, which are not only a working tool for 
solving purely practical problems, but also characterize the efficiency of this solution, 
thus embodying a suitable standard of a basic criterion for truth: practice. The 
authors of this book concur with the thought expressed by Silvester: ‘Number, 
place, and combination are three mutually interbreeding, but excellent spheres of 
thought to which all mathematical ideas can be reduced’. Finally, therefore, we 
conclude that in combinatorics the notion of correspondence is as fundamental as 
that of magnitude in algebra, number in number theory, or figure in geometry. 
Thus, alongside algebra, number theory, and geometry, combinatorics will ultimately 
occupy one of the ‘atomic’ places in the structural unity of all of mathematics. 

The major part of this text is also our reaction to the paper of Robin J. Wil- 
son: ‘Analysis Situs’ (Graph Theory with Applications to Algorithms and Computer 
Science, Wiley, New York, 1985, pp. 789-800) and the articles on combinatorics by 
V.N. Sachkov in the Matematicheskaya Entsiklopediya (Nauka, Moscow, 1985) 
[English translation (revised and annotated: Encyclopaedia of Mathematics, Kluwer, 
Dordrecht, 1988]. In particular, we have had to reiterate some essential statements 
from these works and are thus grateful to the authors of these publications for the 
opportunity to continue the discourse on this theme. Also, we express much grati- 
tude to everyone who helped us write the final text in English: Rob Hoksbergen (the 
translator of the book), Kit Dodson, Bill Kocay, Luise Guy, and Jack McIntosh. 


CHAPTER 1 


Some information from combinatorics 


This chapter gives definitions of the combinatorial notions necessary in the book. 
For a deeper study of these, we refer to the specialized handbooks [15], [53], [55]-[57], 
(60|-[63], [94], {100], [101}. 


1. Sets and operations with sets 


1.1. The notions of a set and a multiset. A set is a whole, consisting of 
different parts. Of course, this description in words can hardly be regarded as a 
strict definition. The matter is that set is a categorial notion, not lending itself to 
a strict definition; its very nature allows various kinds of descriptions. The aim of 
these descriptions is to reflect the most important (attributive) properties of a set, 
in particular, distinctness of all parts of a set, nonorderability of the parts of a set, 
and the wholeness of a set. 

We distinguish two types of parts of a set: elements and subsets. An element 
is taken to be an indivisible and nonempty part of a set, all other parts are taken 
to be subsets. Each element of a set can be regarded as a one-element subset. 
We distinguish a particular part, called the empty set (i.e. it does not contain any 
element) and denote it by @. We assume that every set has such a part. 

Dropping the distinctness of elements of a set leads to the notion of multiset, i.e. 
a collection of elements some of which may be identical (indistinguishable). Each 
multiset can be represented by its basis, i.e. the set of all its different elements, and 
multiplicities, the numbers of repetition of each element of the basis of this multiset. 

A handful of coins can be regarded as a set and as a multiset: If it contains coins 
with the same value, then these are indistinguishable as regards expenditure, i.e. 
they give a multiset, whereas at the same time, a numismatist may be interested 
in the dates on the coins, and if the dates are different on coins of the same value, 
then the handful of coins forms a set. 


1.2. Notations. If a is an element of a set A, we say that a belongs to A and 
write a € A; in the opposite case we write a ¢ A. If B is a subset of A we write 
BCA. The relation C of inclusion of subsets is reflexive (A C A) and transitive 
(if BC Aand ACC, then BCC). If AC Band B CA, then A= B. A subset 
B is said to be a proper subset of A if B C A and A # B. In such a case we write 
BCA. 


8 1. SOME INFORMATION FROM COMBINATORICS 


The simplest numerical characteristic of a set as a whole is the amount of elements 
in it, ie. the cardinality of a set. A set A is said to be finite if its cardinality is 
a nonnegative integer, denoted by |A|. If the number of elements of a set is not 
bounded, the set is said to be infinite. Let |A| = n and |B| = m. If B C A, then 
m <n; moreover, if B Cc A, then m <n. 

We can identify a set by its list of elements, A = {a),@2,...}, where the order of 
the a,’s is immaterial. However, such an explicit specification of a set cannot always 
be given, or doing so may be inconvenient, e.g., the set N of natural numbers does 
not have an explicit listing, since it is infinite. In such cases, a set may be given 
by a description of the properties which uniquely determine whether elements do or 
do not belong to the set. This manner of specifying a set A can be correspondingly 
written as A = {a: a has property R}, meaning that A consists of only those a that 
have the property R(a) = R, e.g., if R(a) consists of the fact that @ is a prime number 
(i.e. @ cannot be represented as the product of two terms different from it and 1), 
then A is the set of all prime numbers. We can also specify a set recursively, when 
the next element is described in terms of previous ones. E.g., such a description of 
the set of natural numbers is 


N = {i: if the integer i ¢ N, theni+1eN,i>1,1eEN}, 
and such a description for the set of Fibonacci numbers is 
F = {pi : py = Di-1 + Di-2, Pi = Po = 1, 1 = 3, 4,... f. 


The means for specifying a multiset are similar to those for specifying a set. E.g., 
the multiset A = {a,a, b,b, b,c} has base {a,b,c} and multiplicities k(a) = 2, k(b) = 
3, k(c) = 1. The multiplicities of elements of the base of a multiset are sometimes 
written as supscripts; specifying A then corresponds to writing A = {a?,b°,c'}. The 
list of multiplicities of a multiset A = {a’,b”,...} is called its first specification and 
is denoted by [A] = [v,w,...]. According to this definition, the first specification 
can also be a multiset, and consists of natural numbers. The second specification of 
the multiset A = {a’,b”,...} is the first specification of its first specification, i.e. 
([A]] = [[v,w,...]]. This implies that if A is a set of m elements, then [A] = [1], 
[[A]} = ([2"]] = {rm}. 

In conclusion we note that every specification of a set must be correct. Nonob- 
servance of this may lead to difficulties of the type of the Russell paradox. This 
paradox is usually illustrated by the example of the barber, defining the set of peo- 
ple he shaves as the set of all inhabitants of his village who do not shave themselves. 
In this specification of a set, it remains unclear whether or not the barber himself 
belongs to the set. Hence, any means of specifying a set must ensure its wholeness, 
whether given by elements, subsets, using operations, etc. 


1.3. Operations with sets. The intersection of two sets X and Y is the set 
XY of all elements that belong to both X and Y, ie. X NY = {x: x € X and 
xeY}. Eg., for X = {1,2,3}, Y = {2,3,4}, X NY = {2,3}, while for A = {1, 2}, 
B = {3,4}, AN B = 0 (such sets are said to be disjoint). Clearly, X 1@ = @. The 
intersection of two or more sets is commutative: X NY = Y 1X, and associative: 


(XNY)NZ=aXN(YNZ=XNYNZ. 
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The union of two sets X and Y is the set X UY of all elements that belong 
to- either X or ¥Y;ie.X UY = {a @ <€ X -org-e YY}, Eg:, if X = {1,2,3}, 
Y = {2,3,4}, then X UY = {1,2,3,4}; clearly, X U@ = X. The union of two or 
more sets is commutative: X UY = Y U X, and associative: 


(KUY)JUZ=XU(YUZ)H=XUYuZ. 


The operations of union and intersection have the important property of distribu- 
tivity: 


(XAVYUZaSOCAVYIUAAZ, (Kvuvyjnz]=AuY)aceu 2). 


The difference of two sets X and Y is the set X \ Y of all elements of X that 
do not belong to Y, ie. X \Y = {a: cE X anda ¢ Y}. Evg., if X = {1,2, 3}, 
Y = {2,3,4}, then X \ Y = {1}; clearly, X \@= X and 0\ X = 9. The definition 
of difference implies that (X \Y)U(X NY) =X. 

The symmetric difference of two sets X and Y is the set X A Y of all elements 
of X that do not belong to Y and of all elements of Y that do not belong to X, 
ie. X AY ={a: xe X andx¢ Yorxe Y andz¢ X}. Eg, if X = {1,2,3}, 
Y = {2,3,4}, then X A Y = {1,4}; clearly, 0A X = X A @ =X. The definition of 
symmetric difference implies that X AY = XUY\XNY. 

The complement of a set Y with respect to (in) a set X is defined only if Y C X, 
and then it is the set Y = X \Y. Evg., if Y = {2,3}, X = {1,2,3}, then the 
complement of Y in X is the set 


YoaX\Y S41), 


The de Morgan laws: if X and Y are subsets of a set Z, then (X NY) = X UY, 
(XUY) = XN Y. 

Sets X,, X2,... form a cover of a set X if X C U;X;; in this case the X; are 
called blocks of the cover. E.g., a cover of the set of natural numbers is given by 
{12s . zy Cc Ussa {044 ot ie 

A partition of a set X is a representation by disjoint subsets of it: X = X;UX2U 
5 XENX;,= 00479). Eg., {1,2,...} = UM, {2}. The sets X; are called blocks 
or parts of the partition. If the number of blocks is finite, it is called the rank of the 
partition. We can conveniently describe a partition by the list of its blocks, since, 
by definition, this list uniquely determines the partition; therefore the list is also 
called a partition. E.g., for X = {a,b,c} the notation (a,bc) denotes the partition 
of X into two parts, a and bc, separated from each other by a comma. 

The specification or type of a partition X = X,;U X2U---UX, is the list of 
cardinalities of its blocks, [|Xq],|Xo|,...,|X;,|]. E.g., the partition (a,bc) has type 
[1,2]. A subpartition (or splitting) of a partition is a partition obtained by a partition 
of the blocks of the initial partition. E.g., the partition (a,b,c) is a subpartition of 
(a, bc). In other words, by taking unions of blocks in a subpartition we can always 
‘glue together’ the initial partition. Finally, we distinguish between ordered and 
nonordered partitions, depending on whether or not the sequence of blocks is taken 
into account, where, moreover, all possible specifications distinct from the usual 
(nonordered) partition are agreed to be special. 
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The sum rule follows from the definition of partition of a set: for each partition 
of a finite set X = X, U---UX,, XN X; = 0 (i £7), the following equality holds: 


|X] = |Xil +--+ + [Xel- 


The generalized sum rule holds for a cover of a finite set X C X, U---UX,, and 

has the form: 
IX] < [Xa] +--+ + [Xl 

The product of sets X,,...,X, is the set J];_, X; = X1-X2-...- X, consisting 
of all ordered tuples (x1,272,...,2,) where x; € X; (¢ = 1,2,...,r). This product 
of sets is also called direct or Cartesian. If X = {1,2} and Y = {2,3}, then 
X-Y = {(1,2), (1, 3), (2, 2), (2,3)}. Hence, each element (21,...,2,) € Tf, Xi of 
the direct product can be regarded as an r-dimensional vector, where x; € X; is the 
ith coordinate of this vector (i = 1,2,...,r). It is convenient to set X -@ = 0. The 
Cartesian product of n factors X,..., X is called the nth Cartesian power of X and 
is denoted by X™. E.g., if X = {1,2}, then 


pC et Ba ry PARI) ie le FB el oP a a ee ee) ee) ea 


The product rule (which plays an important role in enumeration combinatorial 
problems): for arbitrary finite sets X1, X2,..., Xn the following equality holds: 


Dee con el Se el cae el: 


The Boolean is the set of all subsets of X, including the empty set 0 and the set 
X itself. Thus, the elements of the Boolean, regarded as a set, are the subsets of 
X. E.g., the Boolean of X = {1, 2,3} consists of the sets {0}, {1}, {2}, {3}, {1, 2}, 
{1,3}, {2,3}, {1,2,3}. We denote the Boolean by 2* or P(X); the notation 2* is 
used in connection with the fact that for X finite, |2*| = 2!*!. In the Boolean, we can 
naturally distinguish the subsets (consisting of subsets of X) of equal cardinality: 
Ch(X) = {S CX: |S| = k}. Clearly, with this notation P(X) = UXC*(X). 
The sets C*(X) have cardinalities equal to the values of a binomial coefficient: if 


|X| =n, then 
ke = ko nm _ n! 
rears (") k(n —k) 


A graph on a set of vertices S, = {@1,...,@n} is any subset G of the set C?(S;,), 
so that the elements of a graph G C C?(S,,) are the two-element subsets of S,,; these 
are called the edges of the graph G. Thus, each graph on a set of vertices S, = 
{@1,.-.,@,} can be represented by the list of its edges, G = {(a;, aj), (ax, @),-.- }, 
where (a;,a;) € G if and only if the vertices a; and a; are joined by an edge of G. 
Consequently, every pair (a;,a;) in such a list can be interpreted as an edge. 

The complete graph is the graph K, = C?(S,), so that |K,| = (5). 

A cycle is a graph of the form G = {(a1, a2), (@2,@3),---, (@k—1, @k), (@x,a1)}. A 
cycle is usually denoted by C;; clearly |C,| = k. 

A path is a graph of the form G = {(a),@2), (@2,a3),.--, (@x-1, @x)}. A path is 
usually denoted by P,; clearly |P,| = k — 1. 

Usually, graphs are graphically depicted: the vertices of S, are points, and the 
edges are lines joining the pairs of vertices forming an edge of the graph. F.g., in Fig- 
ure 1.1 we have depicted the graph G = {(a1, a2), (@1, @4), (@2,@3), (@2, a4), (a3, as) } 
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a5 
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FIGURE 1.1 


(n = 5). This graph contains the complete subgraph Ks3 (it is also the cycle C3) on 
the three vertices a,, @2,a4 and a path Ps, e.g., the path passing successively through 
5,43, 42, @1, a4. 

There are many different modifications of graphs. 

The directed graph: the edges of G are ordered pairs of vertices. 

The multigraph: the edges of G may repeat. 

The hypergraph. A hypergraph on a set of vertices S, = {@1,...,@n} is any subset 
G of P(S,,), so that the elements of a hypergraph G C P(S,,) are subsets of S,, called 
hyperedges of G. Consequently, hyperedges of a hypergraph G may have cardinality 
larger than 2. 

The k-uniform hypergraph, or k-graph: all edges of G have cardinality k. 

The following are important numerical characteristics of graphs: 


e The verter degree: if a € Sp, then dg(a) = |{e € G: a € e}|, ie. the vertex 
degree is the number of graph edges containing this vertex, in other words, 
are incident to this vertex. 

e The valency: for vertex set S and a nonnegative integer g, the valency 
v(S,q, G) = |{e € G: |SNe| = g}| is the number of graph edges intersecting S 
in a fixed number q of vertices. Clearly, v(a, 1,G) = dg(a). Euler established 
that for each graph the degrees satisfy the identity }>7_, de(ai) = 2|G]. 

An ordered partition is a partition in which the sequence of blocks does matter. 
E.g., if X = {a,b,c}, then all ordered partitions of X are 


with one block: (abc); 
with two blocks: (a, bc), (b, ac), (c, ab), 
: (bc, a), (ac, b), (ab, c); 
with three blocks: (a, b,c), (a,c, }), (c,a, b), 
: (b, a,c), (b,c, a), (c, b, a). 
We denote the set of all ordered partitions of a set X by TX), and its cardinality 
by T(|X|). We denote by T*(X) the set of ordered partitions consisting of k blocks, 
and by T*(|X|) the cardinality of this set. Now, if |X| =n, then 


TXy\= U T*(X),  T(n)= So T*(n). 
k=1 k=1 


As before, the notion of type is well-defined for ordered partitions, as the sequence 
of volumes of the blocks; therefore we denote by T'[ni,...,nr] the set of ordered 
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partitions of type [n1,..., mr], ie. with blocks of volumes ny,...,n,, respectively. 
E.g., the above described set of ordered partitions of X = {a,b,c} with two blocks 
consists of the sets 71,2] and T[2,1], having 3 (= T(1, 2) = T(2,1)) partitions in 
each of them. We denote the cardinality of the set T[n,,...,n,] by T(m1,..., mr). 
Now, if |X| =n = S}_, m, then 


P(X) = Un, perry 
Pas. OSs “UG tignas oe): 
(nj,....Nr) 


nz)L (ni, <a , Nr], 


Here, summation and union is over all types of partitions of rank r. 
These numerical characteristics of ordered partitions may be calculated using the 
following formulas: 


| ! 
Gi sel Ga ( ia = be 


) 
1, 2,.++5 Mr N1!Ng!...n,! 


where n!/(n,! nq!...n,!) is the multinomial coefficient; 


n! Tr 
[Tr = _ | r—k k: 
(n) ei ec nNy!No!... ny! 2.( ) (;) 

n! 7 
nit--tnp=n ny! nN! aoc Nr! 
ni>0 
The Bellean is the set of all partitions of X. E.g., if X = {a,b,c}, then the 
Bellean of X consists of the partitions 


rank one: (abc); 

rank two: (a, bc), (6, ac), (c, ab); 

rank three: (a,b), c). 
Here the blocks are separated by commas. We assume that the partition blocks 
in a Bellean are not ordered, i.e. the partitions (c,ab) and (ab,c) are taken to be 
identical. We denote the Bellean by B(X), its cardinality by B(|X|), the set of 
all partitions with precisely k blocks by B*(X), and the cardinality of this set by 
B*(|X|). Thus, if |X| =n, then 


B(X) = U BCX) B(n) = 3 B¥(n). 
k=1 k=1 


We denote the set of partitions of type [nj,...,n,] by Bln,...,n,], and the 
number of partitions of a set X (|X| = n) of type [m,...,n,], where n = S7}_1 ni, 
by B(n,...,n,). Thus, 

BMX) =UB(Xi,...,X), Bn) = Bim...) 


where summation and union are over all possible types of partitions into r blocks. 


1. SETS AND OPERATIONS WITH SETS 13 


We can calculate these numerical characteristics of the Bellean using the following 
formulas: 


TG ecerte 

Bin, Ny) = ae _ 
: T'(n 
Br(n) = 0). 


B¥(n) = o(n,k), where a(n, k) is the Stirling number of the second kind, 


k = k ik 
ee (()S. pale 2.) on: 
o(0,0) = 1, o(n,k)=0 (n<k), 
= 5 o(n, kale —1)...(e— b+ 1), 


k=0 


B(n) is Bell’s number, 
Bln) = Slr) (=) Ce 
r=0 j=0 Y, 
Bin+1)= 3 (") Bin —1), 
r=0 r 
B(n) = 4 


=F (Dobinski’s formula). 
r=0 °* 


1.4. Operations with multisets. For multisets we can introduce an addition 
operation which has no analog in classical set theory. 

Addition of multisets. Suppose we are given multisets A and B: A has basis 
S(A) = {z,y,z,... } and multiplicities [k4(x), ka(y), ka(z),.-.]; B has basis S(B) = 
{x,y,z,...} and multiplicities [kg(x), ke(y), ke(z),...]. Then the sum (A+B) of A 
and B is defined as the multiset with basis S(A+B) = S(A)US(B) and multiplicities 


[kat p(X), kate (y), kay a(2),---] = 
= [ka(x) + kp(x), ka(y) + ka(y), ka(z) + kp(z),---], 


i.e. under addition of multisets we take the union of their bases and add their 
multiplicities. E.g., if A = {a?,b?,c!}, B = {a!,c°,d*}, then A+B = {a3,b’,c®, d‘}. 
Here, of course, elements absent in one basis but present in the other are interpreted 
to have multiplicity zero in this basis. 

The definition of summation of multisets immediately implies a rule for computing 
the cardinality of their sum: if A and B are finite multisets, then 


|A+Bl=|A|+|Bl= D7 kala) + D7 kal), 
a€S(A) bES(B) 
so that in the previous example |A + B| = (2+3+4+1)+(1+5+44) = 16. 
Submultisets. We say that a multiset B with basis S(B) is a submultiset of 
a multiset A with basis S(A) if S(B) C S(B) and for each a € S(B) we have 
kp(a) < ka(a). 
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We denote inclusion of multisets by the same sign as for sets. E.g., if A = 
{a?,b",c!}, B = {a!,b°}, then B C A, since S(B) = {a,b} C {a,b,c} = S(A) and 
kp(a) =1<2=k,(a), kp(b) =5< 7= ka(b). 

Operating with multisets. The operation of summation provides a very conve- 
nient technique for operating with sets and multisets, similar to the usual operation 
with numbers. Next to the summation of multisets, an important role is played in 
this technique by yet another notion, the wholeness operator which enables us to 
analytically operate with collections as with a wholeness. 

Similarly as done for the Boolean, we take into consideration the set C*(A) = {B: 
BC A, |B| = k} of all k-element submultisets of a finite multiset A. E.g., if 
A = {a?,b*} and k = 3, then C%(A) consists of three multisets: {a?,b}, {a, b?}, 
{3}. 

The wholeness operator of a multiset A is the representation of A as a single 
element: C(A) = C'4I(A). E.g., if A = {a?,b?}, then C(A) = (a,a,b,6,b). In 
case A = S, = {@1,.-.-,@n}, ie. A is a set, the wholeness operator C(S,r) # Sh, 
since according to the definition of C(S,) its cardinality |C(S,)| = (”) = 1, while 
|S, | = n. Thus, in essence the wholeness operator of a multiset A is the consideration 
of A as an integer (whole), and always |C(A)| = 1. From the above-said it is clear 
that if k is a nonnegative integer, then the notation kC(a) must be understood as 
the k-times repetition of the element a. 

For each multiset A the following equations hold (and are axiomatized): 


A=J°C(a)= D> kala)C(a); 


acA a€S(A) 


C(A) = JT] Clo). 


ac€A 


Here the product is to be understood as the usual product of sets. Precisely these 
equations allow us to formally operate with multisets. E.g., using them the above 
introduced operation of summation of multisets takes the following simple form: 


A+B= S> (ka(a) + kp(a))C(a). 


a€S(A)US(B) 


This immediately implies the above given formula for the cardinality of a sum of 
multisets: 


|A + Bl 


>» — (ka(a) + kp(a))C(a)| = 


a€S(A)US(B) 


> — |(ka(a) + kp(a))C(a)| = 


a€S(A)US(B) 


>» — (ka(a) + kp(a))|C(a)| = 


a€S(A)US(B) 


S> (ka(a) + kp(a)). 


a€S(A)US(B) 


I 


Moreover, the operations of intersection and union of multisets can be described 
in a very simple manner. For this we introduce the notation A = min, V = max. 
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Then 
ANB= > (kala) Aka(a))C(a), 
a€S(A)NS(B) 
AUB= S- (k4(a) V kp(a))C(a). 
a€S(A)US(B) 


We will define the product of multisets in such a way that the following product 
rule holds: 


if A and B are finite multisets, then |AB| = |A]- |B|. 


Starting from this requirement we assume that if A and B are multisets, then 
their product is the multiset 


(A- B) = 45 ¥5 C(C(a) + C()). 
a€AbeEB 
From this definition we obtain, as required: 


|JA- Bl = |S) do C(C(a) + C0) = 


a€A bEB 


dD» I€(C(a) + C(o))| = 


a€AbEB 


E51 (£1) (E31) - 141-10 
a€A bEB acA beEB 

E.g., if A = {a?,b'}, B = {a',b*}, then their product eh of nine pairs of 
elements: A- B = {(a,a), (a,b), (a,b), (b, a), (b,b)} = {(a, a), (a, b)4, (6, a)!, (b,b)?}. 
Another example of a product of multisets is the ordinary product of natural num- 
bers, since each natural number can be represented as the multiset consisting of the 
units, i.e. n has basis {1} and multiplicity n. 

The Boolean of a multiset. Let A be the finite multiset with basis S(A) = 
{@),@2,...,a@,} and first specification [k,,k2,...,k,|, ie. the element a; occurs k; 
times in A (i = 1,2,...,r) and the cardinality of this multiset is |A| = S77_, ki = n, 
so that 


A= 3° C(a) = 3S ka(a)C(a) = 5 kiC(ai). 
acA ac€A i=1 
The Boolean of A is the set of its submultisets, including the empty set and the 
multiset A itself. We denote this Boolean by P(A). From the definition of C*(A) 
we have P(A) = UZ_yC*(A) = SZ, C*(A), hence, using the technique of operating 
with multisets we obtain 


n n ky or 
P(A=SC(AN=> DY eo le 
k=-0 k=0 j=0 j=0i=1 


Rite tir= 
7 => 35 I Ci(kiC(a:)) = I] y Ci(k:C(a)) = TT P&C(a)) 


Thus, the Boolean of a multiset can be represented as the direct product of the 
Booleans P(k;C(a;)) of multisets consisting of the single elements a;, repeated k; 
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times. The Boolean of such a multiset obviously consists of the k; + 1 submultisets 
{0}, {a;}, {a;,a;},..., {at*}, ie. [P(kiC(ax))| = ki + 1. Consequently, 

II P(kiC(a;)) 
t=1 


r r 


= TI IP k:C(a))| = T] (ei + 2. 


i=1 i=1 


|P(A)| = 


For the set A = S, = {a1,...,@n} this formula gives the formula |P(S,)| = 2”, 
which is already known to us. Moreover, it is useful to note the case A = kS, = 
f{ak,..., ak}, for which we have 


IC*(kS,)| = Ge '). 


k 


2. Correspondences between sets 


2.1. A correspondence between sets X and Y is any subset Z C X x Y given in 
advance. If (x,y) € Z, then we say that the element y corresponds to the element 
x, or that the elements x and y are in correspondence Z, and we write xZy or 
Z(x,y). The element y is called the image of x, and z is called the pre-image (or 
inverse image) of y under the correspondence Z. If, on the other hand, (x,y) ¢ Z, 
we write rZy. E.g., if X = {1,2,3}, ¥Y = {3,4,5} and the correspondence Z 
consists of the fact that x + y is a prime number (x € X, y € Y), then Z = 
{(1, 4), (2, 3), (2, 5), (3, 4)}. Here the elements 1 and 3 of X have the same image 
(namely 4), while 2 € X has two images (namely, 3 and 5); similarly, 3 and 5 in Y 
have the same pre-image (2 and 2), while 4 € Y has two pre-images (1 and 3). 

A subset Z C J]7_, Xi is called an n-place correspondence between the sets X; (¢ = 
1,2,...,n). Thus, each vector (x), %2,...,2%n) can be regarded as an element of a 
certain n-place correspondence. In particular, this shows that a correspondence can 
be geometrically given, by depicting the corresponding set of vectors in the Cartesian 
product of the sets. Other means for specifying correspondences are also useful: 
graphically and tabularly. We consider these three means in concrete examples. 
The correspondence Z = {x + y are prime numbers} is represented in Figure 1.2. 
Here a given point with coordinates (x,y) means that (x,y) € Z. For the same 
X and Y, suppose the correspondence Z C XY is given by the rule: (x,y) € Z 
if and only if x + y is even. The geometrical (a), graphical (b), and tabular (c) 
specifications of Z are given in Figure 1.3. It is clear from Figure 1.3 that in the 
geometrical specification of Z the relation (x,y) € Z means that a point in the 
plane is drawn; in the graphical specification a line segment is drawn; and in the 
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FIGURE 1.3. abe 
tabular specification a 1 is placed. This table is called the incidence matrix of the 
correspondence. 

The complete image of an element x € X under a correspondence Z C XY is 
the subset {y: y € Y, xZy} C Y; similarly, the complete pre-image of an element 
y € Y under a correspondence Z C XY is the subset {7: x € X, rZy} CX. E.g., 
for the correspondence in Figure 1.2 it is clear that the complete pre-image of 5 
consists of the single element 2, and the complete pre-image of 4 consists of the two 
elements 1 and 3. We can use correspondences to specify multisets. E.g., the multiset 
A = {a?,b',c*,d?} can be given by the correspondence 7 C {a,b,c,d}{1, 2,3, 4} 
defined by the rule: (z,n) € Z = ka(x) = n, ie. when A contains exactly n copies 
of x, where x € {a,b,c,d} and n€ {1,2, 3, 4}. 


2.2. A map ¢ from a set X into a set Y is a correspondence Z C XY such that 
for each x € X there is at most one y € Y for which xZy. This correspondence 
between x and y under ¢ is written as the equality y = ¢(x), and in this case the 
map has corresponding notation @: X — Y. The set X' C X of those x € X for 
which there is exactly one y € Y such that xZy is called the domain of definition of 
d, and the set Y’ C Y of y € Y for which there are x € X such that xZy is called the 
range of values of @. If X; = X, Y; = Y, and ¢(x) = ¢1(x) for all x € X, then we 
say that the maps ¢ and @ coincide, i.e. ¢: X — Y is equal to gd): X,; — Yy. Evg., 
if the sets X = {2,3,4} and Y = {3,4,5} are given, and the map ¢: X — Y assigns 
to an element of X a multiple of it in Y, then the correspondence Z corresponding 
to dis Z = {(2, 4), (3, 3), (4,4)}. A direct verification shows that Z is in fact a map, 
since each element in X is set to correspond to a unique element of Y, (2,3, 4) go 
to (4,3, 4), respectively. We have to make the following remark: if the rule defining 
@ is extended to a larger subset of the integers, then it is easily seen that Z need 
not always specify a map, since there may be elements in X having more than one 
image. In the example above, the domain of definition is the whole set X, and the 
range of values is Y’ = {3, 4}. 

The set @ '(y) = {x: y = o(x), x € X} is called the complete pre-image of the 
element y under the map @. 
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Let X = {21,22,...,2n}. Then ¢: X — Y can be represented as 


o= (ye X2, sey In 
- g(x), o(£2), oes) (Zn) 
where $(2;) € Y (i= 1,2,...,n). E.g., if X = {1,2,3,4,5}, Y = {a,b}, and 
12, Or ir. 5 
P= & is Dy.. 20s 2), 
then the complete pre-image of a is ¢~!(a) = {1,2,5}, and the complete pre-image 
of b is #-'(b) = {3, 4}. 

The set d(X) = {d(x) C Y: x € X} is called the complete image of the domain 
of definition of the map @¢. 

If ¢: X — Y is such that ¢(X) = Y, then we say that @¢ maps X onto Y; in 
this case there is for each y € Y an x € X such that y = (x), and the condition 
@ '(y) #4 holds. For finite X and Y the equality 6(X) = Y means that |X| > |Y]. 

If 6: X — Y is such that for the complete pre-image of each y € Y we have 
ld! (y)| < 1, then for finite X and Y we have |X| < |Y]|. If we have |@~!(y)| = 1 for 
all y € Y, i.e. for each y € Y the condition y = ¢(x) determines a unique element 
x € X, then we say that ¢ sets up a bayective correspondence between X and Y. For 
finite sets X and Y we then have |X| = |Y]. In this case ¢ is called a bijective map 
or bijection. E.g., Let X = {1,2,3}, Y = {a,b,c}, and let ¢@ be such that (1) = b, 
(2) = c, (3) = a, then ¢ is a bijection between X and Y. 

A permutation of a finite set is a bijection of this set onto itself. E.g., if X = 


{1, 2,3}, then 
L238 
Ob e 


is a permutation. If |X| =n, then the number of all permutations of the Y is 
nels 25. 


In fact, the first element can be chosen from any of the n elements of X , the second 
from the remaining n — 1, the third from the remaining n — 2, etc. Permutations 
can be multiplied by the rule 


Wi ee sate EVs Ss 22s “heskee® OPN OI) See a. ae 
41 ta +++ tr} \N Jo +++ Jn Jan Jin ++ Sin) 


This multiplication corresponds to superposition of the maps corresponding to the 
permutations to be multiplied; in other words, to successive application of these two 
maps. This means that if the map 


Len eue: FE 
a= (; Qi up: a 


puts the elements (1, 2,...,n) in the order (i,, 72,...,%,), and the map 


ee oe eae |) 
a= (; 39) ies _) 
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puts the elements (1, 2,...,n) in the order (J), j2,---,jn), then the map ¢i¢; first 
puts the elements (1, 2,...,) in the order (71, 72,...,2,), and then puts this order 
in the order (4i,,Jix,---)Ji,). E-g., if X = {1,2,3} and 


Lea 3 ee 8 
= (; ii i b= (5 9) an 


i<0:3 
60s = (j 3 a 


There is a unit permutation, not displacing any element: 


pia ete a 
NT SD cere PEP 


It is easily verified that for every permutation ¢ we have ed = ge = ¢. There are 
also permutations not leaving any element in place, e.g. the permutation 


12 ...n-1sn 
De A hee8 n 1 


represents a shift, or is cyclic. 

The restriction of a map @: X — X toa subset Y C X is the map ¢: Y — X, 
i.e. the same map ¢, but on a smaller domain of definition. The restriction of ¢ to 
Y C X is called a cycle if 6(Y) = Y and for some partition Y = Y; UY2, Y;N¥2 = Q, 
Y; 4 @,2= 1,2, there is an element y € Y, such that $(y) € Yo; in this case Y; is 
called an orbit of the cycle. 

A permutation ¢ acting on a set X = {2,2%2,...} is called a transposition if it 
leaves all elements, except two, in place, while these two elements change places. 


E.g., the permutation 
123 4 5 6 
lL 2.8 423-6 


is a transposition on the set {1,2,3,4,5,6}. Thus, a transposition has a cycle of 
cardinality two; in this case it is the cycle 


3 5 
ae 9 
If an orbit of a cycle consist of a single element, this element is said to be fized. 
For cycles 
Me ete 14 ove 1g eee Lp eee TL 
we use the convenient row notation (7)22...7,), while for n = 1, (%;) = e. E.g, if 


X = {1,2,3,4,5,6} and 
ey 0 ee ee ec 
ee Oo Ae 2 AG. 6 


then the restriction of ¢ to Y = {1, 2,3} has the form 


de 2 23 
a. 2 


then 
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and is a cycle, the element 4 is fixed, and the cycle 


5 6 
6 5 
is a transposition. 


Thus, each permutation can be represented as the product of its cycles, and 
generates a partition of X into orbits. In turn, each cycle can be represented as a 
product of transpositions, e.g. 


123 4 5 6 
é 124 6 5) = 03156) = 13/1169) 


A rearrangement of a finite set is the complete image of a bijection of this set onto 
itself. E.g., in the previous example, (312465) is a rearrangement, i.e. the lower 
row of a permutation is a rearrangement if the order in the upper row is fixed. 

Two graphs G(S,,) and G’(S,,) with vertex set S, are said to be isomorphic if there 
is an enumeration of the vertices of one such that the lists of their edges coincide. 
E.g., the graphs G = {(a),@2), (@1,a4)} and G’ = {(da2, a4), (a2, a3)} are isomorphic, 
since the enumeration of the vertices of the first graph given by the permutation 
(123) makes the first graph identical to the second. Since each enumeration of 
vertices is uniquely determined by a permutation, we also say that the graphs C'(S,,) 
and G'(S,,) with vertex set S, are isomorphic if there is a permutation 7 of S, 
satisfying 

G(Sp) ae G'(m(Sp)). 


A sequence of objects of whatever kind is a map from the set of natural numbers 
into the set of all these objects. E.g., {1,3,5,7,9,...} is the sequence of all odd 
numbers, and the map consists of putting 1 at the first place, 3 at the second, 5 at 
the third, etc. Passing to the two-row notation, this map can be written as 


Ls 2, 3, 4, 5, 
bs 53 2, le 29: 


Thus, a ‘sequence’ supposes an ordered list of elements, representing a functional 
dependence between its elements and the natural numbers. In the example given 
this dependence is ¢(n) = 2n — 1. The notion of map allows us to introduce the 
notions of operation and function. 


2.3. Operations. We say that an n-place operation X is given on a set X if we 
are given a map A: X() — X which associates to a vector (%1,%2,.--,2n) € X™ a 
single, unique element z € X. We write this as 


CP ACs she): 


Two-place, or binary, operations are the most widespread. A binary operation on 
a set X is a rule associating to an element of X‘?) at most one element of X. Binary 
operations are usually given a special notation, and in the general case we will write 
L=- 41T x2. E.g., if the addition operation is given on the set X = {1,2,3}, then 
to only two pairs of elements an element of X is associated (1 + 1 = 2, 1 +2 = 3), 
since the sums of all other pairs do not belong to X. 


2. CORRESPONDENCES BETWEEN SETS 21 


We say that a set X is closed under a binary operation given on it if to each 
element of X) is associated an element of X. E.g., if X = N = {1,2,...} is the 
set of natural numbers and T' = (+) is the addition operation, then the result of 
this operation is the sum x = 2; + X2, which obviously also belongs to N = X. 
This means that the set of natural numbers is closed under the addition operation. 
We can similarly be convinced that it is closed under multiplication, and not closed 
under the operations of subtraction and division. In this way we can represent each 
binary operation on a set by a ternary, i.e. three-placed, correspondence on this set. 

A binary operation T' on a set X is said to be 


associative: if for any x,y € X the following condition holds: 
(xT y)Tz = xT(yTz); 
commutative: if for any z,y € X the following condition holds: 
xrTy = yTx; 


distributive with respect to an operation O: if for any x,y,z € X the 
following equalities hold: 


xT (yOz) = (xT y)O(«T2z), (yOz)Tx = (yTx)O(zT 2x). 


An element e is said to be the unit element, or neutral element, under a binary 
operation T if for allz € X, eTx=2xTe= zx. E.g., the addition and multiplication 
operations on the set of real numbers are associative and commutative. Multiplica- 
tion is distributive with respect to addition. The unit elements under multiplication 
and addition are 1 and 0, respectively. 

The set of all permutations of the elements in {1,2,...,n} is closed under the 
above introduced operation of multiplication of permutations; the unit element of 


this multiplication is 
ra Hp dis Eiee cacti AY 
ue Gly eee 9 


This operation is associative, but not commutative. 


2.4. Functions. By function we mean a map into the real or complex numbers. 
We list the simplest combinatorial functions: 


factorial: if n is a natural number, then n! = n(n—1)...1; 0! = 1; if n and 
m are natural numbers, then 


Chee n(n—1)...(n—m+1), m<n, 
a 0, m>n; 


binomial coefficient: if n,k are integers, then 


n n!) 
= 0 < hen: 
(;) ml (n— ky! ae 
multinomial coefficient: if n = k; + kp +--- +k, where k,,ko,...,k; are 


integers, then 


n ni 
eS ———— 5 > 0 
oe ky! kel... kil? ki 2 


22 1. SOME INFORMATION FROM COMBINATORICS 


(this is also called the polynomial coefficient); 
entier and fractional part: if xz is a real number, then we denote by [z] its 


entier, i.e. the largest integer not exceeding 7; e.g., [5.3] = 5, [—5.3] = —6. 
So, x is an integer if and only if x = [z]. We denote by |z| the smallest integer 
not less than 2; e.g., ]5.3[ = 6, ]—5.3[ = —5. We have |z[ = —[—z]. The 


fractional part of x is the number {x} = x — [az]; e.g., {7} =0, {2.6} = 0.6, 
{—4.75} = 0.25. Sometimes we will use another notation: |x] = [z] (‘floor’) 
and [x] = |2| (‘ceiling’); 

indicator function: 


x {assertion} = 


1 if the assertion is true, 
0 if the assertion is false. 


2.5. A relation is a correspondence between identical sets. A two-place relation 
is also said to be binary. Examples of and means for specifying a binary relation are 
given in Figure 1.3. 

We distinguish the following properties of a binary relation R C X®) ona set X: 

reflexivity: «Rx for all x € X; 

anti-reflexivity: xRz for all x € X; 

symmetry: «Ry implies yRz for all x,y € X; 

anti-symmetry: «Ry and yRz imply x = y for all z,y € X; 

transitivity: «Ry and yRz imply xRz for all xz, y, z € X; ‘un serviteur d’un 
serviteur de moi ne pas un serviteur de moi’ is an example of an intransitive 
relation; 

dichotomy: for any z,y € X, either zRy or yRa.! 

Relations emerge often in practice. E.g., acquaintance of people is reflexive and 
symmetric, but not always transitive. Every hierarchy is also a binary relation, so 
that various ordered sets may be conveniently characterized by binary relations. 


2.6. An ordered set is a pair (X,R), where X is a set and RF a binary relation, 
Rc X). If we have xRy for x,y € X, ie. (x,y) € R, then we conveniently 
interpret this as x ‘being larger’ than y in the sense of the relation R. If neither 
xRy nor yRx, then x and y are incomparable elements of (X,R). We consider the 
following basic types of ordered sets: 

totally unordered: a pair (X,R) with R = 9; 

totally ordered: a pair (X, R) with R reflexive, anti-symmetric, transitive, 
and total. An example of such a set are the natural numbers, ordered by 
magnitude, i.e. by the relation <; 

partially ordered: an ordered set (X, R) with R reflexive, anti-symmetric, 
and transitive. 

In ordered sets we can conveniently distinguish specific individual elements and 
subsets; we give some of these. 

largest in (X, R): an element w € X such that for all x € X we have wRz, 
i.e. w is ‘larger’ than every element of X. Largest elements are sometimes 
simply called units, and are denoted by 1. 


1 R is then also said to be total or complete. 
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4 


FIGURE 1.4 


JO 5 
FIGURE 1.5 

maximal in (X,R): an element w € X such that X does not contain an 
element x satisfying Rw, i.e. X does not contain an element ‘larger’ than 
w. In other words, every x € X is either incomparable with w or is ‘smaller’ 
than w. 

least in (X, A): an element v € X such that for all e € X we have xRv, 
i.e. v is ‘smaller’ than every element of X. Least elements are sometimes 
simply called zeros, and are denoted by 0. 

minimal in (X, R): an element v € X such that X does not contain an 
element x satisfying vRzx, i.e. X does not contain an element ‘smaller’ than 
v. In other words, every x € X is either incomparable with v or is ‘larger’ 
than v. 

We say that an element x covers an element y if xRy and there is no z € X, 
distinct from x and y, such that xRz and zRy. Atoms are elements covering 0, 
and co-atoms are elements covered by 1. We say that two elements x and y are 
incomparable if (x,y) ¢ R and (y,x) ¢ R, i.e. neither zx is ‘larger’ than y, nor y is 
‘larger’ than z. 

For x,y € X, the interval [x,y] is the subset [x,y] = {z: z € X, yRz, zRr}, ie. 
this interval is the set of elements that are at the same time ‘smaller’ than y and 
‘larger’ than x. An ordered set (X, R) is said to be locally finite if every interval in 
it is finite. 

A chain in an ordered set (X, R) is a sequence of elements a), @2,...,@x,-.. in X 
such that a; Rag, ao Ra3,...,ap—1Raz,.... 

The length of a finite chain is the number of its terms minus 1. 

An antichain is a subset of an ordered set consisting of pairwise incomparable 
elements only. 

An ordered set (X, R) can be conveniently depicted in graphical form: the set X 
gives the points, and the relation R the oriented lines, directed from zx to y if xRy. 
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Q) 
PY 


FIGURE 1.6 


p 2, as 


FIGURE 1.7 


The graph thus constructed uniquely determines (X, R); e.g., the graph in Figure 1.4 
determines a binary relation on X'?), where X = {1, 2,3, 4,5}. In a number of cases 
this graph can be simplified. E.g., if it is known that (X, R) is reflexive, then the arc 
from x to x may be omitted. Similarly, for a symmetric R the graph can be drawn 
undirected. E.g., the graph of the relation Z = {x + y are prime numbers} C X), 
which has already been considered above, where X = {1,2,3,4,5}, is drawn in 
Figure 1.5. 

For depicting an ordered set we use the Hasse diagram: a graph in which the 
edge (x,y) is present if and only if y covers x. Since R is transitive, in this graph 
the condition (x,y) € R gives a path from y to x. To orient the edges of the 
Hasse diagram it is convenient to draw minimal elements in its lower part, and 
maximal elements in its upper part. E.g., in Figure 1.6 we have drawn a partially 
ordered set with two minimal elements (3, 4) and one largest (1). More precisely, this 
graph represents the following partial order: R = {(1, 1), (2, 2), (3,3), (4, 4), (5,5), 
(1, 2), (1,3), (1, 4), (1, 5), (2,3), (2, 4), (5,3)} CX, where X = {1,2,3,4,5}. Hence 
the Hasse diagram is obtained by removing the edges (1,3) and (1, 4) and all loops. 


We give the basic combinatorial examples of ordered sets. 


Boolean, ordered by inclusion of subsets: We consider the Boolean of 
Sn = {@1,@2,...,an}, P(Sp) = URpC*(Sn), and introduce a binary relation 
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FIGURE 1.8 


C on it according to the rule: For X,Y € P(S,) we have X C Y if and only 
if X is a subset of Y. This binary relation is a partial order relation. 
There is a least element in the Boolean, i.e. a subset of all other elements 
of the Boolean. Clearly, such is the empty set §. There is also a largest 
element, i.e. containing all other elements of the Boolean. Clearly, this is 
the set S, itself. Obviously, elements of the Boolean contained in a single 
C*(S,) cannot be included into each other as subsets, so they form an 
antichain. The interval [X,Y] in the Boolean consists of the S C S, for 
which X C S C Y; hence 


PALS fas i b ae, he ah oe 
0 otherwise. 


LX, ¥]| 


In Figure 1.7 we have drawn the Hasse diagram for the Boolean and n = 3. 
The horizontal levels of this diagram consist of antichains, more precisely, 
of C*(S,); in general, the levels of the Hasse diagram are determined as 
the subsets whose elements have shortest paths of identical lengths to the 
minimal element. 

Natural numbers, ordered by divisibility: We consider on the set N = 
{1,2,...} of natural numbers the binary relation | defined according to the 
rule: for z,y € N, x | y = x divides y without remainder. 

This binary relation is a partial order, with least element 1 € N, since every 
integer can be divided by 1 without remainder. It is obvious that the set 
of prime numbers is an antichain under this relation, and that all prime 
numbers are atoms. If x | y, then |[z,y]| is the number of divisors of y that 
are multiples of x, and although N itself is infinite, (N, | ) is locally finite. 


It turns out that the partial orders C and | are tightly related to each other. To 
investigate this interaction, we first consider the Hasse diagram of (N, | ), depicted 
in Figure 1.8. This diagram makes clear that the least element is 1, that the atoms 
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(@,bc) (6,a¢) 0 (c,ab) 


(2,6,c) 
FIGURE 1.9 

are the prime numbers, that the next level consists of the numbers representable 
as a product of two prime numbers, that the next level consists of the numbers 
representable as a product of three prime numbers, etc. Considering the subset 
{1,2,3,5,6,10,15,30} as being ordered by divisibility |, we become convinced of 
the fact that its Hasse diagram coincides precisely with the Hasse diagram of the 
Boolean (P(S3), © ). 

An even tighter relation with numbers ordered by divisibility is possessed by the 


Boolean of a multiset, ordered by inclusion: Let A be the finite multi- 
set with basis S(A) = {a1,@2,...,@n} and first specification [k1,k2,..., k,l, 
i.e. the element a; occurs k; times in A, 7 = 1,2,...,r. Then A has cardi- 
nality |A| = “i, ki = n, so that 


A= S>C(a)= > kala)C(a) = 32 C (a). 


acA a€S(A) 


We introduce on the Boolean P(A) a binary relation C according to the rule: 
for elements X,Y € P(A), X CY «- X is a submultiset of the multiset Y. 
The relation of this Boolean with numbers ordered by divisibility is based 
on the following simple fact. If p1,...,p, are distinct prime numbers and 
M = pips? ... p¥, then the partially ordered (by divisibility) set of natural 
divisors of M has the same Hasse diagram as P(A). 

Bellean, ordered by subpartitioning: The above introduced procedure 
of splitting a partition of a set determines a partial order relation on the 
Bellean. We say that a partition is ‘larger’ than another if the second can be 
obtained from the first by splitting. We denote this relation by the sign —. 
E.g., if X = {a,b,c,d}, then (ab, cd) — (a,b, c,d), since {a,b} = {a} U {b}, 
but (ac, bd) & (a, b, cd), since the union of the blocks of the second partition 
cannot be obtained from blocks of the first. This partially ordered set 
also has least and largest elements: (a,b,c,d) and (abcd), respectively. For 
the three-element set {a,b,c} the Hasse diagram of the Bellean ordered by 
splitting is depicted in Figure 1.9. 

The lexicographical order on sequences is given by the rule: (%1,%2,.--,%n,---) = 
(41, Y2,---5Yn,---) if and only if either x; = y; for all 2, or there is a natural number 
a €N for which x; > y% and xz; = y; for 7 <1. 

This definition is correct also for finite sequences of the same length. The lexico- 
graphical order is used for ordering words in dictionaries, with the condition that if 
a word is shorter than another, then the last, missing, letters of the shorter word 
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are weighted as maximal components; e.g., ‘an’ > ‘and’. In this way we can use this 
order also for sequences of distinct lengths. 

Difference, as well as equivalence of various objects, can also be conveniently 
characterized by binary relations. 


2.7. Equivalence relations. We say that an equivalence relation R is given on 
a set X if R C X®) and R is reflexive, symmetric, and transitive. An equiva- 
lence relation on a set X induces a partition of X; its blocks are called equivalence 
classes. K.g., equality (=) is an equivalence relation, and partitions each set into 
singleton (i.e. one-element) equivalence classes, and each multiset into equivalence 
classes consisting of copies of a single element, as many as the multiplicities of these 
elements in the multiset; thus, the volumes of the equivalence classes coincides in 
this case with the first specification of the multiset. For an arbitrary equivalence 
relation two elements in a single equivalence class are mutually equivalent (pair- 
wise comparable), and elements in different equivalence classes are not equivalent 
(incomparable). Hence, every partition of a set uniquely determines an equivalence 
relation on this set. 

The quotient set by a given equivalence relation R is the set of all equivalence 
classes of R. As an example, suppose an equivalence relation on the set of natural 
numbers is given by the rule: «Ry if x and y have the same parity. This gives 
two equivalence classes, of the odd and the even numbers, so that the quotient set 
consists of two elements. 


3. Binary functions on ordered sets* 


In this Section we let (P,<) be an ordered set (§2.6), and < a reflexive binary 
relation (§2.5), also called an order. A closed chain (§2.6) in P is called a cycle. A 
chain is called an f-chain if it contains either a zero or a unit (regarded as an ordered 
set). The ordered set (P, <) is called acyclic if it has no cycles, locally acyclic if all 
(closed) intervals in it are acyclic, and f-acyclic if it has no f-chains with cycles. 


3.1. Binary functions on (P,<). We consider real-valued functions only. Let 
A(P) be the set of such functions of two variables, defined on P?. If a function 
f € A(P) is associated with the binary relation < on P, then f will be called a 
binary function. The simplest examples of binary functions are the Kronecker delta 


6(x,y) = X{z=y} Va,y € P, 
the order function 
ChE, y) =e X{x<y} Vx, Yy S sé 
also called the zeta function of the set (P,<), and the strict order function 
n(x, y) = C(x, y) _ d(x, y) 


(here, x denotes the indicator function of the set occurring as its subscript). 
The set AJ(P) = Ac(P,<) of all binary functions on (P,<) is defined as the 
subset of functions in A(P) that admit a representation 


flay) = f(ye(a,y) Vay P, (1.1) 


1* This Section is adapted from [74]. 
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i.e. f assumes nonzero values only for pairs x < y or on closed intervals (cf. 82.6), 
and is zero otherwise. It is easy to see that the functions 6, ¢, and 7 satisfy (1.1). 

Note that if in (1.1) the function ¢ is replaced by a fixed function A(z, y) for which 
A(x,x) = 1 for all x € P, ie. if one considers the set of functions A,(P,<) C A(P) 
that are nonzero only on the pairs x,y for which A(z, y) = 1, then this is equivalent 
to considering the set A/(P) relative to the following, new, order < on P: 


on eS NG) oe 


Thus, Aj(P, <) 7 Ac(P, 3). 

Generally speaking, we need not require the order to be reflexive, and dropping 
this requirement does not lead to serious changes of the following results, although 
its satisfaction enables us to avoid at certain places awkward calculations. This is 
related to the fact that there are in fact two binary relations defined on P, the order 
< and the equivalence relation of ‘equality’ (=), while the Kronecker delta, which 
plays a major role in the following results, is defined as a function of the second 
relation only (see [13], [57], [94], [179], [194]). 

3.2. Operations on functions. In A(P) we first introduce two simple opera- 
tions: 

scalar multiplication: if a € R! and f € A(P), then a: f(x,y) = af (x,y). 
addition: if f,g € A(P), then (f + g)(x,y) = f(x,y) + g(x,y). 
It is easy to see that A(P) is closed under these operations. For a locally finite P 


we introduce in A/(P) the following general operation, whose modifications will be 
used throughout this Section. 


K-convolution: if f,g € Al(P), then 
fxg= 0 f(x, z)K(x,z,y)g9(z,y), 
zEP 


where K(x, 2z,y) is a function (from P? into R), called the kernel of the 
convolution. 


Since f,g € Al(P), we have 
f«g(z,y) =D f(x, z)¢(2, z)K (ax, 2, y)C(z, y)9(z,y) = 
a Ss, f(x, 2z)K(z,z,y)g(z, y)- 


Since P is locally finite, this shows that convolution is well defined. A kernel is 
called correct if if it admits a representation 


K(x, zy y) i Ce, z) K(x, ZUG; y) Vxr,Yy,2 € P, 


i.e. if K takes nonzero values on chains x < z < y only.? The convolution operation 
allows us to introduce the concepts of unit and inverse function. Namely, a (right) 
unit function with respect to the K-convolution is a function e € A/(P) that satisfies 


fre(z,y)=flz,y) Vaye P, 
for every function f € AI(P). 


It is obvious that a correct. kernel makes it possible to introduce the operation * in all of A(P). 
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inverse function: if f € Al(P), then a (right) inverse function of f is a 
function g = f~' satisfying 


ff * g(x,y) = e(x,y) VEU er, 


where e is some unit function. In particular, if the unit function is taken 
to mean the delta function, then a function inverse to the zeta function is 
called a Mobius function, denoted by p(x, y) (see [57], [94], [179], [194]). 


3.3. Stable kernels. It is obvious that AJ(P) is not always closed under the 
operations of convolution and taking the inverse. We now try to choose convolutions 
for which 


a) AI(P) is closed under convolution; 
b) the Kronecker delta is a unit function; 
c) there is a Mobius function. 


Convolutions (and their kernels) satisfying a)-c) are called stable on P. If there 
is a Mobius function belonging to AJ(P), such convolutions (and their kernels) are 
called Mobius stable on P. 

A criterion for a) to hold is given by 


PROPOSITION 1.1. The set AI(P) is closed under K-convolution if and only if 
K(x,z,y) = K(x,z,y)C(z,y) = Va,z,yeP, ze [z,y}. (1.2) 
PROOF. Sufficiency. We have 


feg(x,y)= >> f(z,z)K(2,z,y)g(z,y) = 


rSzsy 


- > FEA 2Ca yg) = 


w<zsy 


C(x, y) >> f(z, z)K (x, z,y)9(z,y) = Ca, y) f * g(a, y)- 


Necessity. Assuming that AJ(P) is closed, we find that if 


K (Zo, 20, yo) # K (Xo, 2, Yo) (Xo, Yo), Zo € (Zo, yo], 


then K(xo,2,yo) # 0 and 1 ¥ C(zxo0, yo), ie. X £ yo. But then the functions 
fo(x,y) = 6(x, 20) 6(y, 20) and go(x,y) = 6(x, 2)6(y, yo) on the one hand belong to 
AI(P), and on the other hand their convolution is nonzero at (xo, yo), since 


fo " Jo(Xo, yo) = S_ 6(xo, £0 )6(z, 29) K (0, 2, yo) 6(z, 20)6 (Yo, yo) == 
= \~6(z, %)K (x0, z, yo) = K (20, 20, yo) # 0. 


This contradicts the fact that AI(P) is closed. O 


COROLLARY 1.1. /f the order is also transitive, the set AI(P) is closed under any 
correct convolution. 
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PROOF. In fact, starting from the correctness of the kernel, the transitivity implies 
that 
Since 1 > C(x, y) we have 
K(x,z,y) 2 K(x, z,y)C(z,y), 


which means that 
K(x, z,y) = K(x, z,y)C(x, y). Lj 


Condition b) imposes quite severe restrictions on the nature of the kernel. 


PROPOSITION 1.2. The delta function is a unit function under K -convolution with 
kernel K(x, z,y) if and only if 


Keg) = 1 Va,yeP, xr<y. (1.3) 
PROOF. Sufficiency. If f € Al(P), then 
f *6(z,y) = >° f(a, z) K(x, 2, y)6(z,y) = 


= C(z,y)K (x,y, y) f(z, y) = C(x, y) f(x,y) = F(z, y)- 


Necessity. Assume that 6 is a unit function. If K(x, yo, yo) # 1 for xo < yo, then 
fo(z,y) = 6(x,x0)6(y, yo) belongs, on the one hand, to A/(P), and on the other 
hand its convolution with the delta function is not equal to it at (xo, yo), since 


f * 5( (0, yo) ee Lo, X0)6 (ea yo) K (Xo, 2, yo)6(z, Yo) = 
= S> 4( 2, yo) K (Xo, Z, yo) = K (xo, yo, y)) #- l= fo(xo, yo), 


contradicting the assumption. 0 

In a completely similar way can we demonstrate that delta is a left unit if and 
only if 

PAGE) Sd Ve,yeP, «x<y. (1.4) 
PROPOSITION 1.3. The operation of K-convolution is associative if and only if 
K(z,v,y)K (vu, z,y) = K(a,z,y)K (2, v, 2), GLU Soy. 
PROOF. Sufficiency. Let f,g,h be binary functions. Then 
f *x (9*K h)(z,y) = 
=> Dd f(z, v)9(v, z)h(z,y)K (x, 0, y)K(v, 2,y) = 


vEP zEP 


df (a, v)g(v, 2)h(z,y)K (x, v,y)K(v, 2,9) = 


ESvSzSy 


f(z, v)gv, z)h(z, y)K (x, z,y)K (2,0, 2) = 


asuczsy 


dD f(z, v)g(v, z)h(z, y)K (a, z,y)K (2, v, 2) = 


veP zEP 


| 


I 


rf 


= (f *x 9g) *K h(z,y). 
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Necessity. If for xo < vo < 2 < yo the condition of the proposition does not hold, 
then for the binary functions fo(z,y) = 6(2, 20)6(y, vo), go(x,y) = 6(2x, vo) 6(y, 20), 
ho(x, y) = 6(x, z0)6(y, yo) we have 


fo *K (90 *K ho)(Z0, Yo) = K (Xo, Vo, Yo) K (vo, 20, Yo) F 

# K (Zo, 20, yo) K (Zo, Vo, 20) = (fo *x Jo) *k ho(Xo, Yo): 
This contradicts the associativity of the kernel. O 
In particular, for concrete kernels this criterion takes a lattice character. 


COROLLARY 1.2. /f R is reflexive and transitive, then the set of binary functions 
is closed under any correct convolution. 


COROLLARY 1.3 ({134]). Jf P is finite, then AI1(P, R) is a subalgebra of the al- 
gebra Al(P) if and only if R is transitive. 

COROLLARY 1.4. Jf R is reflexive and f is acyclic, then the set AIC(P, R) is an 
algebra with unit 6 (in general, not associative). 


COROLLARY 1.5. [f R is reflexive and f is acyclic, then the algebra AIC(P, R) 
is associative if and only if every f-chain is linearly ordered in (P, R). 


Sufficient conditions for Mobius stability in the class of sets with a reflexive anti- 
symmetric order are given by the following lemma. 


LEMMA 1.1 (INVERSE FUNCTION LEMMA). Suppose (P,<) is a locally finite 
f-acyclic ordered set with a reflerive antisymmetric order. Assume that the ker- 
nel satisfies the conditions (1.2) and (1.3), and that 


K(z,z,y)#90 Va,yeP, xr<y. (1.5) 


Then a function f € AI(P) has an inverse function f~' € AI(P) (relative to the 
delta function) if and only if f(x,x) #0 Vx € P. 


The proof repeats almost verbatim that of Lemma 2.2.1 in [94]. It is carried out 
by induction, and follows from the possibility of defining f~’ recursively as 


: 6(x,y) f(%, 2) K(x, 2,y) ,- 
f ‘Gos = K j *(z,y). 
f(x,y) rcz<y f(x; L) (x, x, y) 
zsy 
In particular, this implies f~’ € AI(P). Note that the assumptions of antisymmetry 
and acyclicity are essential in the induction argument. 


COROLLARY 1.6. Suppose the order is reflexive and antisymmetric, and P is f- 
acyclic. If the kernel K on P satisfies (1.2) and has delta as left and right unit, 
then f € AI(P) has an inverse if and only if f(x,x) #0 Va € P. 


In particular, K(x, z,y) = ¢(x,y) is Mobius stable on every f-acyclic ordered set 
with a reflexive and transitive order. One can prove that ¢ is Mobius stable even on 
an ordered set with a reflexive and transitive order. 

It is clear that, in general, the existence of a Mobius function is equivalent to 
solvability of the system of equations 


C * p(x, y) = 6(z,y), ia! aoa of 
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for the unknown yp. If P is finite, a precise criterion for the solvability of such a 
system can be expressed in terms of minors. 


3.4. New kernels on ordered sets. In this Subsection we assume that (P, <) 
is a set that is (at least) reflexive, transitive, and antisymmetrically ordered. 

Earlier papers (see, e.g., [57], [94], [179], [194]) considered mainly the unit kernel 
K = 1 on a partially ordered set. Of course, this kernel is Mobius stable on (P, < 
). Effective extensions of the set of kernels have only been made for sets (P, <) 
of a special kind (see, e.g., [202]). Our precise conditions for stability make it 
possible to introduce new kernels for general (P,<) (but still reflexive, transitive, 
and antisymmetric). We now consider several of these. 

Let K(x,z,y) = |[z,y|| be the cardinality of the interval [z,y]. Clearly, (1.2), 
(1.3), (1.5), hold for this kernel, so that it is Mobius stable. 

The Mobius function is defined by 


ulx,y) = (x,y) — >> esa zy). (1.6) 


xr<z<y | [a y] | 


In particular, if we consider the set of all subsets of a finite set S, = {a1,...,@n}, 
partially ordered by inclusion, then 


ly|—|x| 
eval = 2-Fc(z,y) and w(zyy)=(-5) my). 7 


Let r(x, y) denote the length of the largest chain with ‘lower end’ x and ‘upper 
end’ y. Then the following may serve as examples of Mobius-stable kernels on a 
partially ordered set: 


K(2,z,y) = 7r(z,y) + ¢(z,y); (1.8) 
K@zys=¢°", @eN; (1.9) 
_f lleull \. 
K(z,2,y) = He - i (1.10) 
I[x, y]| 
K s 
20 = (ea tall va) 


In particular, the enumeration and classification of stable (M6bius-stable) kernels 
makes it possible to count and classify ordinary combinatorial identities (and their 
inverses). 


3.5. The inversion principle. Suppose that there is defined of the set A/(P) 
of all binary functions a K-convolution with kernel K (zx, z, y). 


THEOREM 1.1. Let (P,<) be a locally finite, reflexively ordered set with zero Op. 
Suppose that we have functions f,g,A,« € A(P), defined on P?, satisfying 


g(y, x) = DU f(z, 2)K(z,y,2) Va,ye P, (1.12) 


z<y 


C*CX(z,y) = O(z,y)K(z,y) = Va, y © P. (1.13) 
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Then 
F(@2)a(@2) =">- oy, 2) AG 2) (1.14) 


ysxr 
PROOF. Since Op € P, the local finiteness of (P,<) shows that all sums below 
are well defined: 


SS oy, z)Ay, x) = >> (= [EDK aie ) Na) = 


ysry yszryv \z<y? 


= SV sz, 26, y) Kz, 9, 2)Ay,2) = 


ysr¥ 2 


= Ss; f(z,2) 2 C(z,y)K(z,y, x)C(y, x)A(y, 2) = 
= e2)\C#OAG 2) = 
= 3 f(za be a)K(2\ 2) =f (Gz) K ea), 


as required. 0 


It is clear that the existence of a zero can be replaced by the finiteness of sums of 
the types (1.12) and (1.14). 

If (P,<) is moreover antisymmetrically ordered, then «(z,x) = K(x,x,x)X(z, 2x), 
and if 6 is a unit function for K-convolution, then EPS AO): 

If « = 1, then ¢\ = yp, whereas if A € AI(P), then (A = A = wp. If K isa 
Mobius-stable kernel, then for « = 1 we can always choose a \ € AI(P) satisfying 
(1.13); in this case the requirement of Mébius stability may be replaced by that of 
correctness and stability of the kernel K. Note also that for « = 1 and (P,<) at 
least antisymmetrically ordered there is a reflexive, antisymmetrically ordered set 
for which (1.13) holds for K = 1 and ‘ordinary’ p: 


ds LY, 
u(x, y) i > epey tlZ,y)s L< Y, (1.15) 
0, hay Sep 


An example of such a reflexive, antisymmetrically ordered set without cycles is 
shown in Figure 3.5 (where an arrow means that strict inequality < holds). Clearly, 
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this example is not transitive, but a simple verification shows that (1.13) holds. By 
adding an arrow from a to 6 we obtain a set with cycles which still satisfies (1.13). 

Finally, if we take the order antisymmetric and transitive, K = 1, « = 1, and 
f(z,x) = f(z), then \ = p € AI(P) and Theorem 1.1 becomes the ordinary Mébius 
inversion principle for partially ordered sets (see [57], [94], [179], [194]). 


3.6. Calculation of » for antisymmetric orders. Here (P,<) will be taken 
locally finite, f-acyclic, and reflexively and antisymmetrically ordered. On Al(P) we 
define the convolution with kernel K(x, z,y) = ¢(x,y). Then Al(P), endowed with 
the operations of addition, scalar multiplication, and ¢-convolution, is an algebra, 
with unit element 6. 

Let 7(x,k,y) be the number of chains of length k with initial element y and 
zero element x in the set (P,<). A method for computing the values of a Mébius 
function, not relying on recursion, is given by the following theorem. 


THEOREM 1.2. Let (P, <) be a locally finite, f-acyclic ordered set with a reflexive 


and antisymmetric order. Then 


w(x, y) = -(-1)* r(x, ky) = Va, y € P. (1.16) 


k>0 
PROOF. Introduce the notation 


FO = (fe fe flea ff 


where f®) = 6. Since A/(P) is an algebra, it immediately follows that 
p= CT = 6+) = 6-9) 49% -9 +..., 
1.e. 
(x,y) = do(-1)*n(a,y) Va, y € P. (1.17) 

k>0 

Since we can easily check that 
n(2,y) = To(a,k,y) (k= 0), 

we obtain the required result. 0 


Note that (1.16) gives only the value of a left Mobius function. The similar formula 
for a right Mobius function has the form 


U(x, y) = > (-1)*7(z, ky), (1.16’) 


k>0 


where 7;(x,k, y) is the number of chains in P of length k with largest element y and 
terminal element x. 

In particular, if the order is transitive and antisymmetric, then 7 = 7, and 
formulas (1.16) and (1.16) coincide and give Ph. Hall’s theorem (see [13], [179], 
[194], [203]). 

Furthermore, even if the ¢-kernel is Mébius stable on a locally finite antisym- 
metrically ordered set (P,<) (which need not be f-acyclic, as above), then the 
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requirement of f-acyclicity in Theorem 1.2 is essential, since the presence of a cy- 
cle in an f-chain would imply that there are infinitely many summands in (1.16), 
resulting, in particular, in equalities of the type 


Zs y(-1. (1.18) 


k=0 


In fact, it suffices to compute the Mobius function p(z, y) of the binary relation 


in explicit form, and to use the above-given analog of Ph. Hall’s theorem, noting 
that there is a unique chain of length k from y to x (for each k > 1), and that 
(x,y) = —1/2. 

At first glance the algebras introduced in this Section may seem utmost abstract 
constructs. It turns out that they are far from this, and Stanley’s phenomenon may 
serve as a first substantiation of this. Nowadays there is a whole theme with origin 
given by Stanley’s theorem, which asserts that locally finite partially ordered sets are 
isomorphic if and only if their incidence algebras are isomorphic (as abstract alge- 
bras). Thus, the structure of the basement (a set with a binary relation) is uniquely 
determined by the corresponding algebra of functions. And this phenomenon takes 
place for a large class of binary relations and algebras; the main results here are due 
to V. Shmatkov. 

Thus, the apparatus of incidence algebras may serve as a model for the actual 
phenomena that can be adequately described by sets with binary relations. A con- 
crete example is given by a combinatorial medium, including such in the form of an 
electronic network. 

Precisely the construction of algebras of functions over arbitrary binary relations 
crucially extends the sphere of modeling actual phenomena, which was earlier re- 
stricted by the axioms of partial order. 


4. Combinatorial schemes 


Combinatorial schemes are the most typical and in practice most often used com- 
binatorial formations. The use of some combinatorial scheme is determined by the 
initial statement of a problem and by the choice of a method for solving it. 

Among the simplest combinatorial schemes, the following two are used most often: 
combination and arrangement. Both these schemes are easy to comprehend by 
invoking yet another natural notion, that of selection, more precisely as an action 
of choosing some objects from a given collection. So, 
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r-combination from n elements: the name given to the result of choosing 
r elements from n elements, not taking into account their order; 

r-arrangement from n elements: the name given to the result of choosing 
r elements from n elements, taking into account their order. 


Since we choose from a set, all n initial elements are distinct, and an r-combination 
is an r-element subset, while for r = n an r-arrangement is a permutation of the 
initial elements. 


Next to the natural notion of selection, a systematization of the simplest combi- 
natorial schemes is also provided, and here we propose the 


listing scheme: The simplest combinatorial unification of certain objects is 
by listing them, i.e. representing these objects by symbols: elements of a list 
(usually written in row form (a1, @2,...,@n,...)). E.g., a set given as a list 
of its elements is such. The set consists of distinct elements, whose order is 
immaterial; combination by simple set attributes, including being different 
and the sequence order, is the methodological basis of combinatorics, and 
induces all the most simple combinatorial schemes for lists: 
lists of distinct elements, whose order is immaterial: sets, combinations. 
E.g., {1,2}, {1,3}, {2,3} are all 2-combinations of the set {1,2,3}. The 
number of all r-combinations from an n-element set equals the binomial 
coefficient (”) =ni/(ri(n—r)!). 
lists of not necessarily distinct elements, whose order is immaterial: 
multisets, combinations with repetitions, selection, totalities, families. E.g., 
{1,1}, {1,2}, {1,3}, {2,2}, {2,3}, {3,3} are all 2-combinations with repeti- 
tions from the three element set {1, 2,3} or, in other words, all two-element 
submultisets of the multiset {17, 22,37}. The number of all r-combinations 
with repetitions (without restrictions on the number of repetitions) from n 
elements is Cae 
lists of distinct elements, whose order is essential: arrangements, per- 
mutations. F.g., {1,2}, {1,3}, {2,3}, {2,1}, {3,1}, {3,2} are all 2-arrange- 
ments from the three element set {1,2,3}; while (1, 2, 3), (1,3, 2), (2, 3, 1), 
(3, 1,2), (3,1, 2) are all permutations of the set {1,2,3}. The number of all 
r-arrangements from n elements is r! (ae hence the number of permutations 
of an n-element set is n!. 
lists of not necessarily distinct elements, whose order is essential: ar- 
rangements with repetitions, permutations with repetitions, sequences, vec- 
tors, tuples. E.g., (1, Ne (1s 2), (2, 1); (1, 3), (3, 1), (2, 2), (3, 3), (2,3), (3, 2) 
are all 2-arrangements with repetitions from the three elements {1, 2,3}. 
The number of all r-arrangements with repetitions (without restrictions on 
the number of repetitions) from n elements is n”. 
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types of lists 
according to: 


being distinct | without 
repetitions 
being orderét 


k— 
= RR A) 


with arbitrary 
many repetitions 


not distinct 


eee ae |; 7 eens he distinct 
Table 1.1. 


This systematization allows us to arrange the simplest combinatorial schemes 
and their most important numerical characteristics as in Table 1.1. This table 
immediately shows the equivalence of certain combinatorial schemes: 


the listing scheme of k-elements from n objects; 
the scheme of arranging k pellets over n distinct boxes; 
the urn scheme of choosing k balls from n distinct urns. 


ae ee 
(a, bc) (bc, a) (a, cb) 
(cb, a) (b, ac)(ac 

(b, ca) (ca, b)(c, ab) 
(ab, c)(c, ba) (ba, c) 


distinct: ordered in 
boxes and blocks 


(a, bed) (bed, a)(b, acd) 
(acd, b)(c, abd) (abd, c) 
(d, abc) (abc, d)(ab, cd) 
(cd, ab) (ac, bd) (bd, ac) 
(ad, bc) (bc, ad) 
R=6 h=3 
(4,1,1)(1,4, (1, 1,4) 
(4, 9:1)(3, 1,2), 3,2) 
(2,3, 1)(2,1,3)(1, 2,3) 


(a, bcd) (ab, cd) 
(b, acd) (ac, bd) 
(c, abd) (ad, bc) 
(d, abc) 


distinct: unordered in 
boxes and blocks 


not distinct 


(25.2.2) 
es ae Sep 
Table 1.2 


The numerical values in the table entries denote the amounts of corresponding 
combinatorial formations. 
Table 1.2 illustrates the following schemes: 
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the scheme of arranging n particles over k boxes without empty boxes; 
the scheme of partitioning n objects into k blocks; 
the scheme of representing a natural number n as a sum of k natural numbers. 


So, these schemes (listing, selection, arrangement, partitioning, urn) have allowed 
us to systematize all elementary combinatorial formations. However, not for every 
problem do the existing combinatorial schemes ensure a unified solution, leaving thus 
unrealized the Leibniz dream of creating an all-embracing combinatorial scheme, 
although the phrase ‘general combinatorial scheme’ has already been ‘in use’. In 
essence, this combinatorial scheme is represented in Tables 1.1 and 1.2, with the 
single difference that the concrete contents of entries of the latter must be replaced 
by general ‘mechanisms’ modelling the initial combinatorial conditions and types, 
and by general means for calculating the necessary numerical characteristics (mainly 
by the method of generating functions or other enumeration methods). For a more 
detailed acquaintance with the general combinatorial scheme see |60], [62]. 

Thus, the notion of combinatorial scheme includes a practical, positive attempt 
to a unified approach to some or other circle of combinatorial problems. 


5. Combinatorial problems and their complexity 


With the appearance of powerful computers the role of mathematical disciplines 
in which discrete systems are studied has become essentially larger. Presently, the 
level of scientific-technological processes and the problems posed in practice have 
produced most profound demands on the methods of investigating such discrete 
systems as graphs, block-designs, matrices, electrical circuits, transportation flows, 
information flows, manufacture organizational systems, logical schemes, and many 
others. Despite the apparent variety and specific natures of these discrete systems, in 
their basics these systems have much in common. We can unify their mathematical 
models and the methods for solving them in individual types, and here we consider 
the main such types. 


5.1. Types of combinatorial problems. In the most general case the condi- 
tions of any problem are (the problem statement is) determined by a list of initial 
parameters or initial data and a description of the required result or of the properties 
the answer has to satisfy. 

Combinatorial problems are problems with discrete variables only. As a rule, 
the initial data of such problems are combinatorial objects: numbers, vectors, sets, 
graphs, families of sets, number partitions, etc. Depending on the description of 
the result looked for, the solutions of combinatorial problems may be either the 
answer ‘yes’ or ‘no’, or some object from a finite (infinite) set, i.e. a set of elements 
with given properties, and also graphs, numbers, partitions, etc. The differences in 
the descriptions of the required result of solutions combinatorial problems can be 
grouped together into individual types, characterized by common properties of the 
final results. Hére we will consider the basic types of combinatorial problems. 

Counting problems are combinatorial problems whose solution consists of the num- 
ber of elements of a set having some property, or a family of properties, given in the 
problem statement. We give some examples of such problem statements. 
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EXAMPLE 1.1. For a given partition (n1,...,m,) of a number n, determine the 
amount of partitions of a number k < n packed in it. 


EXAMPLE 1.2. For a given graph G, determine the number of simple chains of 
length three in it. 


EXAMPLE 1.3. In how many ways can we distribute n elements over n boxes (one 
in each box) such that no ith element falls into box i? 


EXAMPLE 1.4. In how many ways can we make an unordered selection of 7 balls 
from an urn with 11 distinct balls, without replacement of balls (with replacement)? 


Enumeration problems are combinatorial problems in which the solutions are ob- 
jects with given properties (numbers, sets, graphs, number partitions, partitions of 
sets, etc.). The following are examples of such problem statements. 


EXAMPLE 1.5. List the partitions of a number k that are packed in a partition 
(n1,...,Mr) of anumber n> k. 


EXAMPLE 1.6. List all paths of a graph G whose lengths do not exceed a given 
length. 


EXAMPLE 1.7. List the permutations of the five letters A, B,C, D, FE in which no 
two consonants are placed next to each other. 


We note that in certain handbooks enumeration problems are defined as counting 
problems. However, for such a definition the problem of finding the whole set of 
solutions is not singled out as an individual type. Therefore we use the solution 
proposed in [52], in which problems whose required result is the whole set of solutions 
are singled out as an individual type of enumeration problem, while finding the 
amount of admissible solutions is defined as being a counting problem. 

Recognition problems are combinatorial problems with as solution the answer ‘yes’ 
or ‘no’. This type of problem is fundamental in our investigation, hence we consider 
in more detail some examples of stating such problems. Presently, intensive studies 
are devoted to such problems, and many such problems have been given a proper 
name, characterizing the required solution result. 


EXAMPLE 1.8 (PACKABILITY OF PARTITIONS PROBLEM). For a given pair of par- 
titions of numbers k and n, determine whether the partition (ki,...,k;) is packed 
het Cera cr 


EXAMPLE 1.9 (HAMILTONIAN PATH). For a given directed graph G = (V, A), 
determine whether G has a Hamiltonian path, i.e. a directed path passing through 
each vertex in V exactly once. 


EXAMPLE 1.10 (GRAPH COLORING). For a given graph G = (V, £) and integer 
k, determine whether there is a map ¢: V — {1,...,k} such that [v, u] € E implies 


bv) # Hu). 


EXAMPLE 1.11 (PATH IN A DIRECTED GRAPH). For a given graph G = (V, A) 
and subsets S,T € V, determine whether G contains a path from some vertex in S 
to a vertex in T. 
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EXAMPLE 1.12 (MAXIMAL PAIRING). For a given graph G = (V, FE) and integer 
k,, determine whether G contains a set M of k edges such that no two edges in M 
have a common vertex in M. 


EXAMPLE 1.13 (GRAPH CLIQUE). For a graph G = (V, E) and integer k, deter- 
mine whether G contains a clique of dimension k, i.e. a totally connected subset of 


V. 


EXAMPLE 1.14 (TRAVELLING SALESMAN PROBLEM). For a finite set Co = 
(C1,...,Cm) of ‘cities’, given ‘distances’ d(c;,c;) € Zt between each pair c;,c; € C 
and given integer B € Zt, determine whether there is a ‘route’ passing through 
all cities in C and whose length does not exceed B. Here Z* is the set of positive 
integers. 


EXAMPLE 1.15 (INTEGER LINEAR PROGRAMMING PROBLEM). For an_ integral 
(m+n) matrix A and an integral m-vector b, determine whether there is an integral 
n-vector x such that Ax = b, x > 0. 


EXAMPLE 1.16 (3-DIMENSIONAL COMBINATION). For three given sets U, V,W of 
the same cardinality and a given subset T of U-V-W, determine whether T contains 
a subset M such that |M| = |U| and such that if (u,v, w) and (u’,v’, w’) are different 
triplets in M, thenu4wu,vf7v,wFf wu". 


EXAMPLE 1.17 (INTEGRAL KNAPSACK). For given integers cj, 7 = 1,...,n, and 
k, determine whether there are integers x; € {0,1}, 7 = 1,...,n, such that 
i Cala =k: 

EXAMPLE 1.18 (PARTITIONING). For given numbers @1,...,@n, does there exists 


a subset S C {1,...,n} such that Dyes a; = Dyes aj? 


EXAMPLE 1.19 (MAXIMAL GRAPH CUTTING). For a given graph G = (V, £) and 
integer k, determine whether there is a partition of V into subsets V\,V2, Vi 4 9, 
1 = 1,2, such that F contains at most k edges joining V, with V2. 


EXAMPLE 1.20 (SCHEDULING IN A MULTIPROCESSING SYSTEM). For a given set 
A of ‘data’, with execution lengths l(a) € Zt, with number of processors m € Zt, 
and with discrete list D € Zt of execution of these data, determine whether there 
exists a partition A = A, U---UA,, into disjoint subsets such that 

ae ee a ae BS 
max{) /I(a) Le <n} =D 

EXAMPLE 1.21 (VERTEX COVERING). For a given graph G = (V, E£) and integer 
k <|V|, determine whether G has a vertex cover of at most k elements, i.e. a subset 
V' CV such that |V’| < k and for each edge {u,v} € E at least one of u,v belongs 
to V’. 


Mastering problems are combinatorial problems for which the final result of solving 
them is either an admissible solution, or the answer ‘no’. In fact, a number of 
combinatorial problems do not have one solution, but a set of solutions, for one and 
the same initial data. For example, the problem of determining (constructing) a 
Hamiltonian cycle has as many solutions as there are Hamiltonian cycles in the graph 
under investigation. The finding of one of them is a result of solving a mastering 
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problem, while if no one is found (the graph does not have such a cycle), the result 
is ‘no’. In other words, when stating the required result, mastering problems do 
not have the restrictions in the conditions which we would have when searching 
for a single unique solution in a set of solutions. Hence, the mastering problem of 
packability of number partitions can be formulated as follows: 


For a given partition (n;,...,n,), find a partition of rank ¢ of a number 

k <n packed in this partition. 

It is not especially difficult to formulate mastering problems on the basis of the 
Examples !.8-1.20 as a type of recognition problems, giving restrictions on the 
properties of the object looked for. 

Combinatorial optimization problems are extremal combinatorial problems whose 
solution is an object with concrete properties, given in the conditions of the problem. 
Usually such problems are given as restrictions, and the solution of the problem 
requires one to consider the whole subset of admissible solutions. The aim of solving 
all imaginable combinatorial optimization problems is the construction of optimal 
objects, such as circuits, routes, vertex sets, partitions and lists of integers, etc. 
Examples of extremal combinatorial problems are problems in which we are required 
to find a maximum (minimum) of a function F’ defined on a set of systems Q = 
(71,.-.,%n). Such systems may be partitions, combinations, various subsequences, 
etc. Usually, when solving such problems in general it turns out to be difficult to 
evaluate the relative magnitudes of F'(z,) and F'(x,) without directly calculating 
them, while the determination of F'(z) for a concrete 7 sometimes requires the 
solution of another combinatorial problem. 

We consider a version of the optimization problem of packability of number par- 
titions: 

For given partitions (n,,...,n,) and (ki,...,k,), determine a version 

of packing (k,,...,k;,) into (nj,...,m,) such that, ifn > k, the rank 

of the partition of the remainder is minimal. 

As for mastering problems, optimization problems can be formulated on the basis 
of recognition problems. 

Obtaining recognition problems from optimization problems, and vice versa, is 
an important tool. The procedure of this reformulation is not difficult. In fact, 
the problem in Example 1.14 on the existence of a ‘route’ whose length does not 
exceed an integer B can be made an extremal combinatorial problem. For this it 
suffices to remove the restriction imposed by B, and formulate the result of solving 
the problem as: 

Find a ‘route’ of minimal length which passes through all cities in C. 
Exhausting information, including a couple of hundreds of similar problem state- 
ments, can be found in [21], [50]. 

So, we have given definitions and examples of the statement of combinatorial prob- 
lems of five types: counting, enumerating, recognition, mastering, and optimization. 
From the viewpoint of a most general approach to the elaboration of methods for 
solving them, at first glance it may seem inconvenient to single out some of these 
as individual types, since the methods for solving them are the same. In fact, for 
solving the counting problem of Example 1.1 we can use the method for solving the 
enumeration problem in Example 1.5, i.e. enumerate all partitions of k < n packed 
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in (n,,...,m,) and count their number. However, we use such a solution method 
only because we do not know a polynomial or analytical expression allowing us to 
compute the required quantity (the number of packed partitions), not applying the 
method of mastering. 

We note that also the solution of the enumeration problem on packability of 
partitions encounters great difficulties. This can be explained by the fact that we 
do not know an efficient method for solving the recognition problem on packability 
of partitions, as will be shown in the sequel. But not all combinatorial problems or, 
rather, methods for their solution can be characterized by such a state of affairs. 
A set of methods for solving counting and enumeration combinatorial problems has 
been developed; it can be found in [51], [52]. 

The aim of the present investigation is to develop methods for solving combi- 
natorial recognition problems, which are at the basis of the central set-theoretical 
problem of all of discrete mathematics: can we exclude mastering when solving dis- 
crete problems? In other words, the discussion is about the principal possibility of 
finding a required solution (e.g., an optimal one), without running over all or almost 
all possible versions of solving the problem. This problem is not only of mathemat- 
ical nature, it is also a cognitive problem. The cognitive side of the problem lies in 
distinguishing the class of problems for which there is no solution algorithm. As an 
example we may give Hilbert’s tenth problem: Does there exist an algorithm which 
recognizes, for a given polynomial p(x1,...,2%n) with integer coefficients, whether the 
equation p(21,...,%n) = 0 has a solution in integers. Yu.V. Matiyasevich proved 
in [49] that such an algorithm is principally impossible. Also, the existence has been 
proved of combinatorial problems for which the solution can only be obtained by 
exponential algorithms. 

Consequently, the solution of combinatorial problems of this type must be ‘care- 
fully’ approached, i.e. taking into account the possible absence of any algorithmic 
solution method at all, as well as the possibility of solving them by exponential 
algorithms only. 

The lasting attempt to develop a theory of combinatorial problems, and the prac- 
tice of solving such problems, led, in the 1970’s, to the sudden emergence of an 
interest in the analysis of the computational complexity of solving such problems. 


5.2. Complexity of combinatorial problems. First of all we give a defini- 
tion of complexity of a problem, and give some notions on which the analysis of 
complexity rests. 

An algorithm is a strict sequence of instructions (steps), making up a procedure 
for solving a problem posed. Without loss of generality in this definition, we consider 
an algorithm to be a computer program, written in a formal machine language. It 
is convenient to assume that an algorithm is a solution of the problem posed if it 
gives a valid answer on the whole domain of admissible values of the problem. For 
example, an algorithm A is not a solution of the travelling salesman problem if, as 
the result of its execution, it does not give a minimal route for at least one set of 
initial parameter specifications C,, d(c;,c;), B. 

The complexity of combinatorial problems is defined in terms of the complexity 
of the algorithms for solving them. In fact, it is intuitively clear that if for a given 
problem there is no algorithm known that would give a ‘fast’ exact answer, then 
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the problem can be regarded as complex. The measure of complexity of algorithms 
is taken to be their execution time, considered as a function of all the parameters 
figuring in the domain of applicability of the algorithm. 

The input dimension of a combinatorial problem is the number of symbols with 
which the input parameters of the problem can be coded. As a rule, the code of 
such parameters is defined in a pre-given fixed alphabet. For example, any input 
data of the combinatorial problem of recognizing packability of number partitions 
can be coded in the alphabet {/,0,1, 2,3, 4,5,6,7,8,9}. If we pose the problem of 
determining the packability of the partition (3, 2,2,1,1) in the partition (4, 2, 2,2), 
then the following sequence of symbols is its input: 4/2/2/2//3/2/2/1/1. The input 
dimension of this problem is 24. 

The time complexity of an algorithm is a quantity reflecting the time spenditure 
in executing the algorithm with a given input dimension. It can be computed as a 
function which assigns to every concrete input dimension the quantity that is the 
maximal time of executing the algorithm for various values of the parameters in 
initial data of this dimension. It is clear that the execution time of an algorithm 
depends not only on the method of solution embodied in it. It will also depend on 
the type of computer on which it is implemented, and also on its implementation. To 
exclude such indefiniteness when estimating the time complexity of algorithms, the 
algorithm is expressed by a number of elementary steps (operations), to be executed 
by a hypothetical computer. Here it is assumed that the time of completing any 
elementary step is equal to the unit. Thus, the time complexity of an algorithm is 
defined as the maximal number of elementary operations needed for executing the 
algorithm when solving a problem with fixed input dimension on the set of possible 
concrete input data for this problem. 

Presently there are two levels of complexity of algorithms. In their definition, we 
write that a function f(x) = O(g(2)) if there is a constant C’ such that for sufficiently 
large x the inequality f(x) < Cg(x) holds. Here, f(x) and g(x) are functions defined 
on the set of positive integers and taking positive real values. 

A polynomial algorithm is an algorithm whose time complexity is bounded above 
by a polynomial p(r), where r is the input dimension of the combinatorial problem. 
In other words, the complexity of such an algorithm grows polynomially with the 
input dimension of the problem, and the rate of growth is defined as the degree of 
this polynomial. 

An exponential algorithm is an algorithm for which there is an exponential lower 
bound for its time complexity. 

It is clear that, when solving a single problem, various algorithms will have dif- 
ferent time complexities. The choice of this parameter as determining factor in 
deciding the efficiency of an algorithm is not accidental. In fact, the time of solv- 
ing extremal combinatorial problems turns out to be a critical parameter for the 
majority of processes and phenomena formalized by such problems. 

Thus, the complexity of a combinatorial problem is defined as the complexity of 
the algorithms for solving them. We will consider certain aspects of the efficiency 
of applying the above given types of algorithms from this point of view. 


5.3. Efficiency of algorithms for solving combinatorial problems. In the 
widest sense of the word, the notion of efficiency of an algorithm is related to the use 
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of all computational resources necessary for its computer implementation. However, 
when comparing the efficiencies of algorithms it is usual to take their working time 
as basis. Such a comparison of algorithms can be done by different means or by 
various criteria. Two such means are encountered most often: according to the 
mean working times of the algorithms, or according to the duration of their working 
times. When comparing algorithms according to mean working time we take as 
measure the averaged value of this time on the whole domain of definition of the 
algorithm. When comparing by duration of working times, we evaluate the time at 
a point in the domain of definition at which the algorithm has to work longest of 
all. 


The choice of a comparison measure determines the practical use of the algorithm. 
Therefore, since time restrictions are often a dominating factor, characterizing the 
fitness of some algorithm in practice, when considering methods for solving combi- 
natorial problems our main attention will be concentrated on this kind of resource. 

The characteristics of the complexities of algorithms introduced above determine 
to some extend the efficiency of using the algorithms. In fact, polynomial algorithms 
are conveniently taken to be such for which the functions of the input dimension 
of the problem (r) are the functions r, rlogr, r?, r°, r°, while for exponential 
algorithms this function f(r) has, e.g., the form 27, 3", r'&", r!. It is usual to 
consider polynomial algorithms to be more efficient for practical purposes. This is 
supported by the following arguments. 


It is clear that with increasing input dimension of a problem, every polynomial 
algorithm for its solution is more efficient than every exponential algorithm. We 
prove this assertion using Table 1.3. In it we have given tentative times needed 
by polynomial and exponential algorithms (for various input dimensions of combi- 
natorial problems) on a hypothetical computer realizing one step of any algorithm 
in 1 microsec. The distinction between the two types of algorithms is especially 
clear when solving problems of large dimensions. We must note that the majority 
of exponential algorithms are simply variants of solution by the method of complete 
sorting out. 
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Table 1.3 


Another positive property of polynomial algorithms in comparison to exponential 
algorithms is their great ‘sensitivity’ to an increase in computer speed. Table 1.4 
testifies of this. In it we have indicated the dimensions of problems, for various 
complexity functions, which can be processed by a hypothetical computer with fast 
speed R = 1 op/sec during 1 hour of functioning, and which can be processed by 
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it if R is increased a 100 and a 1000 times. This table testifies to the fact that for 
polynomial algorithms an increase in computer speed gives a substantial increase in 
the possible dimensions of problems to be solved, while for exponential algorithms 
only additive increase in the possible dimensions of the problems occurs. 


function | computer with high R= 1000 


speed R= 1 
N, 
5.61 Np 
5.98 Ny 


Table 1.4 
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Yet another fact testifying to the efficiency of polynomial algorithms is the prop- 
erty that they are a closed class, i.e. the possibility of combining polynomial algo- 
rithms for solving partial cases of compound combinatorial problems—the combined 
algorithm is also polynomial. 

Despite these arguments in favor of the efficiency of using polynomial algorithms, 
it is necessary to study also the other side of characteristics of polynomial algorithms. 
Is an algorithm of complexity r®° a practical solution of a problem? Obviously not, 
since the time needed for executing it even for input dimension of the problem r = 3, 
is an astronomical number, and it may turn out that some exponential algorithm is 
much more efficient for this input dimension. For example, it is clear from Table 1.3 
that for r < 20 the algorithm with complexity 2” is much more efficient than the 
algorithm with complexity r°®. Therefore, in a finite time span the efficiency of 
application of some algorithm must be determined by the concrete conditions of 
the problem to be solved. Suppose we have algorithms A,,A2,A3,A4, and As, 
whose complexities are 1000r, 100logr, 10r?, r°, and 2", respectively. Then A; 
is more efficient than the others for input dimensions 2 < r < 9, the algorithm 
Az is such for input dimensions 10 < r < 58, Ag for 59 < r < 1024, and A; for 
r > 1024. Moreover, there are well-known exponential algorithms recommended for 
use in practice. For example, the simplex method for solving linear programming 
problems, as well as the branch and bound method, has exponential complexity. 

Here it is important to note that polynomial algorithms can be constructed only if 
we succeed in penetrating more deeply into the essence of the solution of a problem. 
Therefore, even the discovery of a polynomial algorithm of complexity r® or r!™°, 
which cannot be regarded as efficient in practice, is a step towards the solution of 
the problems posed. 

The difference, as regards algorithmic complexity, of solutions of combinatorial 
problems and their intensive study, which started in the 1970s, has led to a classifi- 
cation of all combinatorial problems in terms of complexity. 


5.4. Classification of combinatorial problems in terms of complexity 
of their solution. Having considered two types of algorithms for solving combi- 
natorial problems, we have naturally taken into account the fact that not for all 
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practically posed problems there are efficient solution algorithms. Whether such 
algorithms exist or not for certain problems is unknown, but the search for efficient 
solution algorithms, which started in the 1950s, has taken one of the important 
places in discrete mathematics. Failures made by experts in this field of investiga- 
tion have served as source for developing a theory which allows us to produce relative 
complexity bounds for newly arising combinatorial problems already at the stage of 
posing them. At the basis of this theory lies the investigation of combinatorial recog- 
nition problems which on the one hand turn out to be convenient for conducting 
such a kind of investigation, and on the other do not restrict the generality of the 
theory itself, since for each extremal combinatorial problem we can without much 
difficulty state a recognition problem strongly related to it. For a short exposition 
of the essence of this theory we start with basic notions and definitions. 

The class NP of combinatorial recognition problems consists of problems having 
an exponential upper bound for the complexity of solving them, while the verifi- 
cation of the correctness of a solution is ensured by a polynomial algorithm. For 
example, the solution of the problem in Example 1.13 on the existence of a clique 
in a graph G(V, E) of dimension « may consist of successive verification of all Ge 
subsets of V having cardinality «. It is obvious that the number of such subsets 
grows exponentially. However, for each set, the verification whether it is a solution 
(connectedness of the edges in F’) can be done by a polynomial algorithm. 

The class P of combinatorial recognition problems consists of the problems having 
a polynomial solution algorithm (which may yet be unknown). In other words, P 
consists of relatively simple combinatorial problems. 

After these definitions, the question naturally rises of the interrelationships of 
these classes. This question arises first and foremost because it is not difficult to 
prove a simple relation between these two sets of combinatorial problems: namely, 
P is a subset of the problems in NP, i.e. P C NP. However, the question whether 
the classes P and NP coincide or not has up till now not been answered, and it is 
one of the central problems in all of discrete mathematics: If P #4 NP, then what 
combinatorial problems constitute the set WP \P, and what are their complexities? 
In the investigation of the relative complexity of combinatorial problems of this 
set, and of the class VP indeed, an important role is played by a concept from 
mathematical logic. 

A polynomial reduction of a recognition problem A, to a recognition problem A2 
is a polynomial algorithm A, for solving A; which uses as certain elementary steps 
an algorithm Az which solves Az. Moreover, if Ap € P, then A; € P also. 

A special kind of polynomial reduction is of interest. 

We say that a recognition problem A, can be polynomially transformed to a 
recognition problem Ag if and only if an input x (of arbitrary dimension) of A; can 
be transformed in polynomial time (relative to the input dimension of x) into an 
input y of Az such that if x is an input of A, with answer ‘yes’, then y is an input 
of Aj with answer ‘yes’, while if x gives answer ‘no’, then y gives answer ‘no’. It is 
clear from this definition that polynomial transformation of problems is a particular 
case of polynomial reduction. 

The study of combinatorial problems in WP using polynomial reducibility has 
made it possible to distinguish a subset of problems in WP which have the property 
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that they have the same complexity. 
An recognition problem A € N’P is said to be an NP-complete recognition prob- 
lem if all other problems in WP can be polynomially transformed to A. 
The class of NP-complete problems can be characterized by the following impor- 
tant property: 
If there is a polynomial algorithm for solving at least one W’P-complete 
combinatorial problem, then there are polynomial algorithms for solv- 
ing all MP-complete problems. 


Up till now no NP-complete problem can be solved by any polynomial algorithm 
known. Taking this into account, many experts adhere to the conjecture that there 
cannot exists a polynomial solution algorithm for any NP-complete problem. A 
proof of this conjecture has not yet been given. 

To prove that a combinatorial problem belongs to the class of M’“P-complete prob- 
lems it suffices: 


e to prove that it belongs to VP; 
e to prove that some NP-complete problem can be reduced to this problem. 


The first. problem which was proved to belong to the class of MP-complete prob- 
lems is the problem of ‘realizability’. Its formulation and the proof of its belonging to 
the class of NP-complete problems is given in {21]. The combinatorial ‘realizability’ 
problem can be formulated for Boolean variables, and is one of the six fundamental 
NP-complete problems which are commonly used for proving the W’P-completeness 
of new combinatorial problems. In this list occur five other problems; these are given 
in Examples 1.9, 1.13, 1.16, 1.18, and 1.21. Methods of polynomial reduction and 
proofs of W’P-completeness, as well as the set of MP-complete combinatorial prob- 
lems in full extent are given in [50]. 

The practical value of our theory concerning the classification by complexity of 
combinatorial problems lies in the fact that when a new combinatorial problem 
emerges, it is possible to estimate its relative complexity. This, in turn, allows us to 
introduce a certain clarity in the choice of methods for solving the problem stated. 


CHAPTER 2 


Extremal problems on packability of number partitions 


This Chapter contains the basic mathematical results of the investigation on pack- 
ability of number partitions and presents the most complete list of results in this 
direction known at present. As an application of the applicability of these results we 
outline their relations with an archaic question on weighings and other problems. 


1. Number partitions 


In practice one often has to solve problems which involve natural numbers and 
sums of such. A convenient combinatorial interpretation of such problems is the 
notion of number partition. Number partitions, as an individual mathematical no- 
tion, first appeared in correspondence between J. Bernoulli and G. Leibniz. At the 
time of their appearance, partitions were the traditional object of enumeration prob- 
lems in combinatorics, and served as a powerful stimulus for the development of its 
methods, in the first instance of its enumeration methods. Quite recently one has 
succeeded in extending their domain of use to extremal problems. 

Apparently, one of the earliest proper extremal number-theoretical results is due to 


J. Sylvester, whose theorem asserts: Let 7),...,7; be mutually prime prime natural 
numbers, and let s(r),...,7r¢) be the largest integer s that cannot be represented in 
the form 


t 
s=) ari, where a; € No = {0,1,2,...},¢=1,...,t. 
i=l 


Then 
8(r1,72) = Tire —7T1 — 2. 
For t > 3 the problem of computing the exact value of s(r),...,7¢) is still open, 
and is called the Frobenius problem. 
To give a feeling for this problem we turn to a particular result: 


s(n,n+1,n + 2) =n{5| —1. 


It is proved as follows. We denote the righthand side of this equation by S(n), 
and prove that it cannot be represented as a function of n,n + 1,n+ 2. Assume the 
contrary: 


S(n) =an + b(n + 1) + c(n + 2) = (a+ b+ c)n + (6 + 2c). 
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Thenat+tb+c< [2 — land 6+ 2c > n— 1, s0 that c > ae contradicting the 


representation S(n) =n [3] —l< Fake ae 

We now verify that S(n) is the largest number that cannot be represented such, 
i.e. that every number larger than it is representable in terms of n,n +1,n+2. Each 
natural number larger than S(n) can be written as 


"(b +2] 4 @=1,2...5 G=1,.0.,0. 


We now consider four cases: 
n odd, i = 1,..., | +1; then 


n([5 te) i= (“= -i-1) (nt + G-Ym4D + =n, 


n odd, i = [2] + 2,...,n; then 


n(ff]+2)-#= tae 0+ (+42- fel) 


neven,i=1,...,3; then 


a ({3 re) —i= (F -4) (+2) + in 41) + @= In. 


n odd, 1 = 3,.-.,7; then 


“(5 re] —i=(n—-i)(n+1)+ (ite-F)n, 


This proves the required. This proof was obtained together with A. Klimov and 
I. Kan as a result-of the analysis of data of a special package of computer programs, 
created for the computation of exact formulas in the Frobenius problem. For not 
simply numerical values of s but precise formulas, see Problem 2.26. 

Thus, even a question of representing one number by a linear combination of 
several numbers turns out to be nontrivial. What happens to the presentation of 
sums by other sums? 


1.1. Basic concepts and definitions. A partition of a natural number n is a 
representation of it as an unordered sum of natural numbers: n = n; +--: + mn; 
these numbers n, are called the parts of the partition, and their number r is called 
the rank of the partition. 

A composition is a representation of a natural number n as an ordered sum of 
natural numbers. Thus, a composition may be regarded as an ‘ordered partition’. 

For example, for n = 6 the partitions are: 

rank one: 6 = 6; 

rank two: 6=5+1,6=4+4+2,6=3+43; 

rank three: 6=4+1+4+1,6=34+2+4+1,6=242+42; 
rank four: 6=3+1+1+1,6=2+2+1+4+1; 

rank five: 6=2+1+1+1+41; 

rank six:6=1+1+1+1+41+41. 
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The compositions of n = 6 are: 


rank one: 6 = 6; 

rank two: 6=5+1, 6=4+2, 6=3+43, 
6=1+5, 6=2+4+4; 

rank three: 6=4+1+1, 6=3+2+41, 6=242+42, 
6=14+4+4+1, 6=341+42, 
6=14+14+4, 6=24+3+41, 


=24+1+3, 
6=14+3+42, 
6= 1 +23: 


rank four: 6=3+1+1+4+1, 6=24+2+4+1+41, 
6=14+3+1+1, 6=24+14+2+4+1, 
6=1+14+34+1, 6=24+1+4+1+42, 
6=14+1+1+3, 6=14+242+41, 
6=14+241+4+2, 
6=14+14+2+42; 

rank five: 6=2+1+1+1+41, 

6=14+24+14+1+41, 

6=14+14+2+41+41, 

Shoe 42a 

6] 1414-14 1+ 2; 

rank six:6=1+1+1+1+1+1. 


It is easy to find the number of rank-r compositions of a natural number n: it 
is the number of ways of distributing r — 1 strokes in the n — 1 intervals between 
n points. It is a): If the rank is not fixed, then a stroke may be distributed in 
each of the intervals or not, and the total number of compositions of a number n is 
thus 2”~!. Consequently, compositions and partitions may be regarded as ordered 
and unordered multisets, respectively, with as elements natural numbers. In corre- 
spondence with this, a partition is often written in vector notation, (n,,...,n,) Fn, 
which denotes that n = n; +---+m,, or in shortened notation (n{',...,n@7) Fn, 
meaning that the part n; occurs in this partition a; times, i.e. n = ayny +--+: tapn,, 
and the rank of this partition is aj +--- + a,. Thus, every partition can be repre- 
sented as (171, 2”,...,n™") F n, where m; is a nonnegative integer, indicating how 
often the number 7 occurs in this partition of n, i.e.n = | imi, and 37, m; is the 
rank of this partition. We may depict a partition graphically by a point diagram, 
called Ferrer diagram, e.g. 


(122.344) ° 


This picture is convenient for representing various transformations of partitions. 
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For example, if the given diagram is rotated, we obtain a Ferrer diagram of the form 


which clearly corresponds to the partition (1,3,5,6), called the conjugate partition 
OF (1,22 37.4): 

Partition problems are more difficult than the corresponding composition prob- 
lems. For example, even the computation of p(n,r), the number of rank-r partitions 
of n, i.e. the calculation of the number of solutions of the equation n = x; +--+: +2,, 
XZ, >--- > 2,, in natural numbers zx; is one of the fundamental moments of enu- 
meration partition theory, and one may get acquainted with it through the classical 
handbooks of MacMahon and Andrews. Our main problem, however, is to get 
acquainted with the theory of partitions as an object in extremal combinatorial 
problems. 

Next to conjugation there are other kinds of correspondences between partitions. 
A sufficiently general form of extremal problem concerning partitions can be stated 
as follows: How may partitions can there be in a given correspondence? The selection 
of a concrete correspondence between partitions is determined by the conditions of 
the practical problem; more precisely, we should select a correspondence for which 
the partitions serve as combinatorial scheme for the problem. Therefore, below we 
introduce and study correspondences between partitions by means of which a wide 
circle of important applied problems can be solved. 

For a long time, extremal problems on number partitions did not form a separate 
direction, although the main underlying facts appeared much earlier. It is useful 
to turn our attention to one of them; more precisely, to a particular instance of a 
very general theorem which is now called in honor of its author, the English logician 
Ramsey. To this end we define a correspondence > between partitions by the rule: 
a partition (n1,...,n,) is in correspondence > with a partition (ki,...,k,) if there 
is ani (1 <2 <r) such that k; < m, ie. if the second partition has a part not 
exceeding the corresponding part of the first partition. We can then ask: For a 
given partition (k},...,k,), what is the smallest n = n(k,,...,k,) for which for any 
partition of this n into r parts the correspondence (n,...,m,) = (ky,.-.,k,) will 
hold? It is not difficult to verify that this smallest n(k,,...,k,-) exists, and can be 
computed by the formula 


nki,...,kr) = Doki —r +1. (231) 

i=] 
We can also compute the converse characteristic: For a given partition (m1,...,nr), 
compute the largest k = k(mi,...,n,r) each partition of which into r parts will have 


the property that (m1,...,m,r) = (ki,...,k,). This characteristic can be computed 
by 


k(nm,...,2r) = Son tr—1. 2.2) 
i=l 


We prove both formulas. The quantity n = n(k;,...,k,-) cannot be smaller than 
the righthand side of (2.1), since in that case there would be a partition for which 
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the required correspondence would not hold: 

Sa Pah Lea Ss hie 

i=l 
If now n would equal the righthand side of (2.1’), then in any partition (n),...,m,) + 
n we can find a part n; > k;, contradicting that nj; < kj; —1 (¢@=1,...,r), and thus 


we obtain the contradictory system of inequalities 


Sok r= (kh -1)>n=S ok —r tl. 
i=l i=] i=l 
The quantity k = k(n,...,n,) cannot be larger than the righthand side of (2.2’), 
since in this case we could find a partition for which the required correspondence 
would not hold: 


(m1,.--,M) = (m4 +1,...,n, +1) b Song +r. 
i=l 
If now k equals the righthand side of (2.2’), then n = k—r-+1, and hence, by (2.1’), 
the required correspondence with all rank-r partitions of k is satisfied by all rank-r 
partitions of n, and not only by (n),...,n,). This means that if k = k(n,...,n,r) 
is expressed as in (2.1’), then every rank-r partition of k is in correspondence with 
every rank-r partition of n. In particular, (2.1’) immediately implies the well-known 


Dirichlet principle: If n(k,r) is the smallest integer n such that in each 
rank-r partition of n there is apart not less than k, then 
n{k,r) =rk—k +1. (2.3’) 
In fact, it suffices to put kj = k (¢ = 1,...,r) in (2.1’). 
This principle is often stated in terms of allocations, and is then called the 
Principle of boxes: For any allocation of r + 1 objects over r boxes, there 
is a box containing at least two objects. 


In fact, it suffices to put k =: 2 in (2.3’), and to note that each allocation of n 
indistinguishable objects over r indistinguishable boxes can be adequately repre- 
sented by a partition of n into at most r parts. Thus, (2.1’) generalizes Dirichlet’s 
principle, but already cannot be treated in terms of allocations. However, there is a 
correspondence between partitions that can be treated well in terms of allocations. 


1.2. Four questions. Here we give four concrete problem statements, which in 
the course of the book will be called upon to aid in the perception of the material. 


QUESTION 2.1. How many weights does one minimally need to weigh any integer 
number of pounds from | to k? 

On equi-arm beam balances we provide for two kinds of precise weighings—single 
pan and double pan. In the first case the weights may be put into one pan only; in 
the second they may be put into both. 

In his book ‘Analysis Infinita’, more precisely, in the chapter on number partitions, 
L. Euler used the method of generating functions to substantiate the efficiency of 
two well-known fastest growing sequences of weights {(p +-1)*}i~0,12.... (p = 1,2) for 
p-pan weighing, respectively. Of course, other authors too have singled out precisely 
these sequences, since these are the most efficient systems of weights for weighing 
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any integer weight. In the finite case (load not exceeding k) it is natural to assume 
that the total weight of the weights is k; in this case geometric progressions are far 
from always being efficient. 


QUESTION 2.2. How many edges, m(n, H;,), must an n-vertex graph G,, mini- 
mally have, if among any k vertices there is a subgraph isomorphic to a prescribed 
k-vertex graph H;,? 

For example, imagine that one has a board with n terminals and that it is required 
to join these terminals by wires such that any k terminals are mutually ‘connected’. 
In other words, representing the terminals by vertices and the wires by edges, we 
arrive at the necessity of constructing a graph in which every k vertices are joined 
by at least one cycle. Here, of course, it is natural to minimize the total number of 
wires. 


QUESTION 2.3. What is the least amount of objects, of any kind, needed to re- 
alize all outcomes in the scheme of allocating n indistinguishable particles over r 
indistinguishable boxes? 

At first glance this seems paradoxical; the problem is best understood by con- 
sidering a concrete numerical example. Let n = 6 and r = 2. Then all possible 
outcomes of this allocation scheme are: (6,0), (5,1), (4,2), (3,3), where each num- 
ber indicates the amount of particles in each box. If we now consider three groups 
of 6 particles with 1, 2, and 3 particles in each of them, respectively, then a di- 
rect verification shows that the outcomes of the initial scheme may be realized not 
only by the 6 particles, but already by these three groups: e.g., (4,2) = (3 + 1, 2), 
(3,3) = (2+1,3), etc. Therefore the question concerning the least possible number 
of such groups makes sense. 


QUESTION 2.4. In the process of functioning, the memory of a computer becomes 
‘scattered’ into occupied and free sectors—-so-called fragments. If we now have to 
feed new information into the computer’s memory, e.g. programs or data arrays, 
requiring memory sizes k,,...,k;, then there arises the natural question: Can these 
sizes be distributed over the free memory fragments of sizes nj,..., Nr? 

Computer memory fragmentation is not just a simple concrete situation requiring 
solution for a given selection of sizes of fragments and inquiries, it is a phenomenon 
lying at the basis of a number of important processes in the actual functioning of the 
computer’s memory. Thus, it is also at the basis of developing theoretical approaches 
to the investigation of this phenomenon. The process of dynamic computer memory 
allocation is an example of most general, but not simple, fragmentation (see Problem 
2.21). 


1.3. Packability of partitions. The main correspondence between number par- 
titions which we will investigate is packability of them. 


A partition (ki,...,k:) can be packed into a partition (n,,..., nr) if there is a map 
@: {1,...,t} — {1,...,r} such that the following system of inequalities holds: 
So ky Sm, eat en 22 (2.4’) 
jEb-1 (i) 


where @'(i) = {j7: 7 © {1,...,t}, 6(7) = i} is the complete pre-image of the 
element i under ¢. 
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FIGURE 2.1 


In other words, the partition (kj, ..., k,) can be packed into the partition (n1,..., mr) 
if the parts kj of (k),...,k,) can be grouped together into r groups (each part k; 
goes into one group, and empty groups are omitted) in such a way that after adding 
all parts k; in each group we obtain r numbers p; < n; (i = 1,...,7r). If the partition 
(k,,...,k) can be packed into the partition (n),...,n,), then we write, using the 
set inclusion sign, (k,,...,k:) GC (mj,...,m,). For example, (2,2,2) C (4,2), since 
(4,2) = (2 + 2, 2); however, (2, 2,2) Z (3,3), since the three pairs cannot be grouped 
into two groups each of which does not exceed three. 
Packability is a binary relation on the set of all partitions of all natural numbers. 
It is not difficult to verify that this binary packability relation has the following 
properties: 
a) reflexivity: (m,...,Mr) © (m1,-.-., Mr); 
b) antisymmetry: if (k),...,ke) © (my,.-., Mr), (m1,---, Mr) © (ky,..., ke), then 
(Kis cox php (yc sag te) 
c) transitivity: if (ky,...,ke) CS (my,...,7m), (my,.-.,7) C (m,..-, ny), then 
Ci gst Ey 2 Sects) 
Consequently, packability is a partial order relation on the set of number parti- 
tions. 
We introduce the following notation: 


P: the set of all partitions of all natural numbers; 

P(n): the set of all partitions of the number n; 

P,: the set of all partitions of rank r; 

P,(n): the set of all partitions of rank r of the number n. 


Consequently, P.(n) = P(n)/N P,. 

We regard the sets P, P,, and P(n) as being ordered by packability; the set P.(n) 
can be conveniently regarded as being lexicographically ordered. 

The Hasse diagram of P(6) is depicted in Figure 2.1. From it we can clearly see 
that the partially ordered set P(n) has a largest and a smallest element: these are 
(6) and (1°), respectively. The levels in the Hasse diagram are the sets P.(n), which 
are depicted in lexicographical order. 

In essence, the main question concerning packability of number partitions consists 
of establishing the packability of a fixed partition (k),...,k:) into another fixed 
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partition (n),...,7,). In other words, is there a packing 
(ky,..., ke) G 2h eee ed 


Next to this question of testing the possibility of packing arises the question of 
realizing it: how fast can the packing of one partition into another be realized? It 
is obvious that these questions are algorithmically equivalent, since the presence of 
a fast testing algorithm ensures the presence of a corresponding packing algorithm, 
and conversely. 

We estimate the complexity of a complete listing in order to determine packability 
of two concrete partitions. By the definition of packability, this listing reduces to 
the listing of all maps ¢: {1,...,t} — {1,...,r} and the verification of the system 
of r inequalities (2.4’) for each of these maps. Since the complete pre-image of each 
such map ¢ is an ordered list of t not necessarily distinct elements, taking any of the 
r values, by the listing scheme the total number of such maps is r‘. Consequently, 
a complete listing in order to determine packability consists of the verification of r* 
systems of inequalities (2.4’), or of the verification of the rt! inequalities making 
up the system (2.4’). 


FIGURE 2.2 
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2. Hasse diagrams and Mobius packability functions 


It turns out to be useful to draw the Hasse diagram of the whole set P and some 
of its subsets. (See Figure 2.2.) It is clear from this drawing how to construct the 
Hasse diagram of P: first one draws the two independent diagrams of the sets P(n) 
and P(n + 1), with the first one level lower than the second, and then one draws 
the edges {(n,..., Mr), (™1,---, Mr, 1)} between them, and only these edges. In fact, 
if qg € P(n), p,(q,1) € P(n +1), and p ¢ Ut(q,1) = {x € P: (q,1) € x} and 
q © p, then for any concrete packability of qg into p only one unit volume remains 
unfilled, which means that (q,1) € p, contradicting the assumption. Thus, the 
Hasse diagram of P can be represented by the sequence of Hasse diagrams of the 
sets P(n), n = 1,2,..., and edges between them as indicated. These edges can be 
naturally interpreted as a ‘shift’ of P(n) inside P(n+ 1) together with a raise of one 
level. It turns out that for an arbitrary interval this procedure gives information on 
the Mobius function also. 


LEMMA 2.1 (SHIFT LEMMA). Suppose the interval (x', y'| is the result of ‘shifting’ 
an interval |x,y|. Then 


u(z,y') = —p(2,y). 


PROOF. We proceed by induction with respect to n = |[z,y]|. If n = 1, then 
|x, y'| is a chain of length 1. Consider an arbitrary n > 1. Then 


weyy=- YS wla,z)=-|  w@2+ DY wlz,2)| = 


r<z<y’ x<z<y z<y 
z€|z’,y'| 
> wz,2z)=ind=—- SS -p(z,z)= YO w(z,z) = —p(z,y). Oo 
z<y! z<y’ riz<y 
z€[x’,y'| z€(z,y| 


In particular, this simple, but very general, result implies that all information on 
the Mobius function of P is concentrated on the subsets P(n) only. In addition we 
need only use the following lemma. 

In fact, this lemma is an instance of the theorem on the Mobius function of the 
discrete product of two partially ordered sets, given in the book of R. Stanley. 


LEMMA 2.2 (LEMMA ON THE ZERO ZONE). If (P,<) is a locally finite partially 
ordered set, and if D C (P,<) is a subset such that for a fired element p € (P,<), 
Vq € Dag € |p, 4]: [p.al \ p,m] CD, 

then 
VqeD: plp,q) = 0. 


PROOF. Assume the contrary and consider the smallest q € D satisfying the 
requirement but for which p(p,q) ~ 0. Then 


OA pp,g)=- dS. wp.z2=-| dX w,2)+ DY wlp,2) 
pSz<q z€(p,qi| z<q 
z€lp,q|\{p.q1] 
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FIGURE 2.3. n=6 


Here q is precisely an element for which the condition mentioned in the requirements 
is fulfilled. Therefore the last sum must vanish: either the summation range is empty 
or all terms are equal to zero, since the range of summation lies entirely inside D 
and consists of elements of levels less than qg, by the choice of g. Since the one but 
last sum is identically zero by the definition of the Mo6bius function, the total sum 
must equal zero, contradicting the assumption. O 


It is easily seen from the diagram how the levels of the diagram of P are made 

up: 
WAP) = P-alt-D, t= 1,2,.... 
1>0 

Altogether, the high degree of visuality of this diagram of P is clear from the 
following example. Consider P(< n), the set of all partitions of all numbers not 
larger than n, and consider in it all partitions without unit parts, as well as the 
partition (1). It is clear from the diagram that the thus distinguished subset is 
isomorphic to the set P(n). This follows in particular from the well-known identity 


|P(n)| - 1 = SO IP(k; = 2)1, 
k=2 


where P(k; > 2) is the set of all partitions of the number k without unit parts. 
For n = 6 see Figure 2.3. 
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3. Simplest properties of packability of number partitions 


Next to packability one has also studied other binary relations on partitions. Some 
particular cases of packability have been considered earlier. For example, on P(n) 
ordering by packability has also been called domination by either decomposition or 
pasting. In these particular instances one studied, as a rule, either enumeration 
or structural problems related to partitions (see, e.g., [127]). But the generality of 
P(n) is not sufficient for a convenient statement of extremal problems. In the given 
generality the notion of packability was introduced in [5] in connection with the 
modeling of a number of engineering phenomena. 

The question of determining packability for two concrete partitions is an algo- 
rithmically hard problem, since this problem reduces to the solution of a system of 
diophantine equations [5]; also, it is equivalent to one of the problems on the list 
of algorithmically hard problems (the so-called bin-packing problem |21]). However, 
sometimes one succeeds in reducing the number of input parameters. 


3.1. The exchange lemma. Each number partition may be interpreted as a 
multiset; more precisely, to a partition (1%,2°,3°,...) corresponds the multiset 
{12, 2° 3°,...}, and conversely. This makes it possible to use the operations of 
union, intersection, and difference on partitions. For example, if p and q are two 
partitions of a single number, then pMq denotes the partition obtained as the result 
of intersecting the multiset of all parts of p with that of g. The use of these opera- 
tions in the study of packability of partitions makes it possible to detect a number 
of most. simple properties of packability. 


LEMMA 2.3. A partition q can be packed into a partition p if and only if the 
partition q — (pq) can be packed into the partition p— (pq). 


In other words, the question of packability of partitions is equivalent to the 
question of the packability of the partitions obtained from the original ones by 
deleting in each of them an identical amount of identical parts. For example, if 
(1, 2,2,3,5) Cc (2,3,3, 7), then (1, 2,5) C (3,7), and conversely. 


PROOF. Sufficiency is obvious, so we prove necessity. Suppose the partition g = 
(q1,---,;Q) can be packed into the partition p = (p,...,p4), ie. the system of 
inequalities 


Dp dea O14 (2.1) 


holds, where 1 <i <t and g = (@11, 41,2, ++ 5 41; 92,15- ++) Ul): 

If in (2.1) we have equality for at least one 7, then we can immediately pass to 
partitions with fewer parts. If now for some q; there is a p; such that p; = Gm, 
then the 7th inequality can be taken out from (2.1), and the ith inequality can be 
written as 


Di 2 Qa he Ge Pt Pa Sa aa Fa ee aS 
ep ee Gime Ee Opa PP ae Gan ee Ga 


Thus, we pass to a partition with lower rank. Since this process is finite, we obtain 
the required. 0 
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It is obvious that the practical value of the exchange lemma is restricted by the 
complexity of distinguishing an identical amount of identical parts in two fixed 
partitions. 

However, one way or the other, the practical need to solve the question of pack- 
ability of partitions has not completely disappeared. Moreover, a number of actual 
situations do not leave the time to perform a complete listing. It is thus necessary to 
search for faster methods of verifying a guaranteed packability. For example, if we are 
to verify packability of a partition (k,,...,k,) | k into a partition (n,,...,n,) Fn 
and it is known that k > n, then an answer is obvious even without complete list- 
ing: the first partition can not be packed into the second. It is precisely this trivial 
reason that lies at the basis of the here presented extremal approach to constructing 
fast methods for verifying a guaranteed packing. In this sense the verification of the 
inequality k > n is no other than the verification of an extremal (in fact, largest 
possible) number k, since n is the largest possible value of k for which there can be 
a partition of k that can be packed into a partition of n. Thus, the extremal ap- 
proach to establishing packability consists in detecting and computing the extremal 
characteristics of partitions (which are nontrivial, but have polynomial complexity) 
whose comparison gives a guaranteed condition for packability or nonpackability. 

The first extremal result concerning packability of partitions is: 


3.2. The rank condition for packability. In a number of cases we succeed in 
solving immediately, without any algorithmic verification, the combinatorial prob- 
lem of recognizing packability of partitions. For example, the following theorem 
saves us from the necessity of clarifying the question of packability of partitions of 
certain fixed ranks. 


THEOREM 2.1. Let t(n,k,r) be the smallest number t for which 
Vp © Pn) Vq € B(k): aC p. (22) 
Then 


u(n,k,r) = max (k—|"[ 41,1), (2.3) 


PROOF. If max(k — |n/r{+ 1,1) = 1, then k < |n/r[ and by Dirichlet’s principle 
there is in any partition of n into r parts a part not less than k, i.e. the required 
packability holds. 

Suppose now that 


om (t= J2[ 00) “k-J8[ 


Then the truth of the packing 
CS ceipe (|=|.---.J=[) E P.(n) 
i r 


implies the truth of the inequality t > k—|n/r[+1. We prove this case by induction 
with respect to n. 

Assume that (2.3) holds for all values up to n—1 inclusive. We have to prove it for 
n. Let t= k—|n/r[ +1, p = (m, pra(n—m)) € P,(n), q = (d,pe-1(k —d)) © Pa(k). 
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Clearly, m > |n/rl|=k—-t+1>d. Ifm=d=k—-t+1, theng = (k-t+1,1%), 
and thus Vp € P,(n), p C q, so that g = (1'). So we assume m > d > 1. 

The packing g C p follows from the packing pi_1(k — d) C (m— d,p,p_1(n — m)), 
which, in turn, follows from the inequality t—1 > t(n—d,k—d,r). By the induction 
hypothesis, this inequality takes the form k — |n/r| = t—1> t(n—d,k—d,r) = 
k—d-—|(n—d)/r[ +1, or |n/r[+1<d+](n—4)/r|, which holds ford >2. O 


It is obvious how to use this theorem in establishing packability for two parti- 
tions. If we are to verify the packability of a partition (k,,...,k,) into a partition 
(n,,..-,M,), then fulfillment of the inequality t > t(n,k,r) implies (k,,...,k%) C 
(n1, wee tbe): 


COROLLARY 2.1. If n(k,t,r) is the smallest n for which 
Vp € P.(n) Va € i(k): a Cp, 


then 
n(k,t,r) = max(k,r(k — t) + 1). 


The next step in answering the question of packability of two concrete partitions 
lies in dropping the freedom of choosing the partition to be packed. In other words, 
in fixing the ‘smaller’ partition by choosing from the partitions into which packing is 
to be realized. Thus, the answer to the corresponding extremal problem will in this 
case depend not on three parameters, as in Corollary 2.1, but on t+1 parameters (all 
parts of the partition (k,,...,k,) + k to be packed and the rank r of the partitions 
in which this extremal bound must guarantee the packing (k,,...,k:) Fk). 


3.3. Sal’nikov constants. An essential widening of the class of extremal prob- 
lems regarding number partitions is obtained by the results and problem statements 
of S.G. Sal’nikov. Below we give an exposition of his results and statements. 

Let P(k,p) = {(1™,...,p%) Fb k} be the set of all partitions with parts at most 
p. Clearly, P(k,1) = {(1,...,1)}, P(k,k) = P(k), and, in general, P(k,p +1) D 
P(k, p). 

The structure of P(k,p) is described in the following assertion: 

if k’ and k” are two partitions of a number k, the partition k’ is 
lexicographically smaller than k”, and k” belongs to P(k,p), then k’ 
also belongs to P(k, p). 


Let k, p, r be natural numbers. The constant n(k, p,r) is defined as the smallest 
n such that Vk € P(k,p) Vn € Pp(n): k CR. 
Prove that, for natural numbers k, p, r (1 < p< k), 


= k aii oN ng ieee 
n(k,p,r) = x(n |=] +0 ‘gyi p.b+ ([ aes | i) »), 


Let r(k, p,n) = max{r: Vk € P(k,p) Wi € P,(n): k C7}. The following equation 
readily follows from the definitions of n(k, p,r) and r(k, p, n): 


r(k,p,n) =max{r: n(k,p,r) <n}. 


Using the formula for n(k, p,r) and this equality, calculate an explicit expression for 
r(k,p,n). 
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For an arbitrary (fixed) partition 
(Ai,-.-,k&) Fk, 


the function p = p(k,,..., kt) is defined as the largest integer p having the property 
that for any partition k’ = (1™,...,p%) / k the packing k’ C (k,,...,k;) holds. 
Prove that the ‘order’ equality 
P(k) = In(k) 
holds, i.e. there are real constants c’ > c’ > 0 (independent of k) such that 
cIn(k) < P(k) < c’In(k). 


4. The principle of complete packability 


THEOREM 2.2 (PRINCIPLE OF COMPLETE PACKABILITY). Let ky > kp >--- > 
k, and r be natural numbers, and let n(k,,...,kj;r) be the smallest n for neh the 
partition (k,,...,k:) + k can be packed into every partition of n with at most r parts. 
Then 


Thisstaghar) = max ni, (3b +k + ( (r — nor). (2.4) 


PROOF. We denote the righthand side of (2.4) by f(k,...,kir). Clearly, 
n(ki,..., kyr) > f(ki,..-, ker), since the packing (k,,...,k:) C (n—- (r — 1)(ki — 
1), (k; — 1)"~') implies the inequality n — (r — 1)(ki — 1) > ky +--+ + ki. Moreover, 
ift > 1, then 


flys. Ber) > hi +f (hays cher), (2.5) 
because if 7 is an index maximizing f(ke,...,k:jr), then 


f (Rigerss REY) SOS ee ee eee eS een ene 


j=l j=2 
= ky Se ro a el 

We prove (2.4) by induction with respect to t. For t = 1 it is precisely Dirichlet’s 

principle. For the induction step from t — 1 to ¢ it suffices to prove that if n = 

f(ki,.-.,k37), then the required packability takes place. Consider an arbitrary 


partition (n,,...,n,) / n. We always have n, > ky, since f(ki,...,ki3r) > ky + 
(r — 1)(k, — 1). Therefore the packing 


(ko,.--, kyr) C (ny — ky, na,..., nr) (2.6) 


implies the packing (ki,...,k:) C (mi,...,n,). In turn, (2.6) follows from (2.5) and 
the induction hypothesis 


—kyt+noat::-+n,=n—-—k, = f(ky,..., ker) —ky > f(ko,..-, kr). 


If, moreover, n; = k,, then we can use the obvious monotonicity of f with respect 
tor. O 
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In the statement of the principle of complete packability the condition ‘at most’ 
can always be omitted, except in the degenerate case k = t < r. The principle can 
also be formulated in a dual manner, as a formula for the largest r for n fixed. 


COROLLARY 2.2. [fk, > 1,k, > ko >--- > ky are natural numbers, (ki,...,k) 
k <n, and r(ky,...,ki3n) denotes the largest r such that the partition (ky,..., kt) 
can be packed into every partition of n into r parts, then 


n— d_1 ky 
r(Rigs askin) = sin, Ae +1. 

In fact, by the principle of complete packability the required r is the largest integer 
solution of the inequality n > n(ki,..., ken). 

Facts concerning packability of partitions may be presented in terms of allocations. 
For example, if k = n(k,,...,k:;r), then each allocation of k particles over r boxes 
can be realized by allocating ¢ groups of particles with k; particles in group j (7 = 
1,...,¢), under the condition that each group is completely put into a single box. In 
particular, for each allocation of n particles over r boxes there are ¢ distinct groups 
of particles (with [(n + r — 1)/(t + r — 1)] particles in each group) that completely 
belong to the boxes. For example, Figure 2.1 makes clear that every allocation of 
6 particles over two boxes can be realized by allocating only three compositions of 
3, 2, and 1 particles, respectively. The question of the smallest number of such 
compositions which would realize all allocations of all particles is natural. Below we 
will consider this question in more detail. 

It is obvious how to use the principle of complete packability to establish packa- 
bility of two partitions: if we are to verify packability of a partition (k,,...,k,) into 
a partition (n;,...,n,), then fulfillment of the inequality n > n(k,,..., kj; 7) implies 
the packing 

(ka. ky) © (m1,---j My): 

For example, n(3, 2, 1;2) = max(5,5,6) = 6, and thus the partition (3,2, 1) can 
be packed into every partition of 6 with at most two parts. This remains true if we 
consider an arbitrary natural number not less than 6. Moreover, in this example 
Theorem 2.2 does not say anything about the packability of the partition (4, 1,1) 
into (5,1) or (4,2), nor on the packability of (4, 1,1) into (3,3). 


5. Packability with restrictions 


Sometimes it is required to guarantee a packability (ki,...,k,) F & not in all 
partitions of a number n with r parts, but only in some of these. An extremal result 
guaranteeing packability of a fixed partition in not all partitions of a given rank is 
as follows. 


THEOREM 2.3. Let n2,...,Nr, kj > +--+: > ki, and r be natural numbers, and let 
n(ky,...,ke3ne,...,nr) be the smallest n for which every partition (pi,...,pr) of n 
with r parts and such that pj < ni (4 = 2,...,7) has the property that (ki,..., kz) © 
Dijes, De). lhen 


WU Rigas Ki, 22.4) = max (35 5S hin Gah: — n), (220) 


1<i<t 1=2 
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PROOF. a) If f(ki,...,k:) denotes the righthand side of (2.7), then 
f(kiys+-y ke) — ky > f (kayo. Reg r): (2.8) 


In fact, if 2 maximizes f(ke,...,k:), then 
j Rises ; ki) 2 yh as $2 min(m, k; _ 1) = 
j=l 1=2 


Sa kG + So kj + $5 min(m, ki — Lyk + f(ko,..., ke). 


j=2 1=2 


b) Let (p),...,pr) be a partition of f(ky,...,k,) with r parts in which p; < n; 
({ = 2,...,r). Then it contains a part not less than k;. In fact, in the opposite case 
we would have 


f(ky,..-,k) > ky + 5° min(m, ky —l)> 


1=2 
> > min(pi, ki —-l)= Sp Saf Kineeo skp): 
l=1 l=1 


c) We now prove the theorem by induction with respect to t. For t = 1 equation 
(2.7) gives 


RR? 15540 os) SR = min(n, k — 1), 


1=2 
and by b), any partition (p),...,p,) of n(k;n2,...,n,) with r parts and such that 
ps <n; (i = 2,...,r) contains a part not less than k. 


d) We now assume that the required holds up to t — 1 inclusive; we show it to 
hold for t too. Let (p1,...,p,) be an arbitrary partition of f(k),...,k,) with r parts 
and such that p; < n; (2 = 2,...,r). Then by b), there is a part p; > k,. Thus, if 

(Rosi Re) Spin esa Pp HHij as Bde); (2.9) 
then we have the packing (k),...,k:) C (m,..., pr). In turn, (2.9) follows from (2.8) 
and the induction hypothesis 

Pi tees + py — hy tes t pp = f(Ri,.--, he) — hi 2 

> f(ko,..., ky Sa IL May sas Ke Masha a5 Wyn): DO 
As distinct from the principle of complete packability, this theorem guarantees 


not only establishment of packability of partitions, but for certain partitions it can 
also be used as a nonpackability criterion. This is demonstrated in 


COROLLARY 2.3. If ay Ti = TUCK hes oi WE Nala Me) then Ripa wa Ke) S 
(n1,...,Mr). 


If 
rT 
WR en hi age, Ne) >So ni 2 Max n(kr,..-, he; ma,--. 77 — Londo) 
i=l pe 


then (ki,...,k) Z (m,.-., np). 
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Proor. Let M be the set of all partitions (p1,...,p,) of the number n = SY, 1: 
for which pj < nj (i = 2,...,7r), and let M; be the subset of all partitions in 
M for which pj; < nj —1 (j = 2,...,7r). Clearly, M = Uf_.M; U (m,...,n,). 
Since n(k,...,k¢3n2,...,Mr) > n, there is in M a partition (q,...,q-) such that 
(ky,...,k:) Z (q1,---,@-). However, since 


n> max hiss pkhine ates ng = lye y), 
2<j<r 


for every (pi,---, Pr) © Uf_2My; we have the packing (k),...,k;) © (p1,---, pr). Thus 
we must have (q),.-.,@r) = (m1,-.-,mr). O 


In particular, Theorem 2.3 thus allows us to assert that the partition (4, 1,1) can 
be packed into the partitions (5,1) and (4,2), while Corollary 2.3 establishes that 
it cannot be packed into the partition (3, 3). 

On the other hand, the relation between these theorems is very essential. More 
precisely, the function n(ki,...,kjj n2,...,,r) is not only similar to the function 
n(k,,...,k:;r), but can sometimes be expressed in terms of it: 


l 
WU Rigas: ky (m"")) = S¢ k; + n(m+ Lkiee her) —-m— l, 


j=l 


where ky4; <m < ky. In fact, 


n(k1,..-, ki (m"—")) max (= k; + (r — 1) min(m, k; — »| = 


StS j=l 


{| 


l l 
max (34 + (r—- 1)m, So ky + nab) = 


j=l j=l 


fl 
M~ 


oti ales ey = 
l 
= Sok; +n(m + Lokpiasa sched) —m-—l. 


6. Extrema of complete packability 


We call the quantity n(k,,...,k:;r) the bound of complete packabiltty and the 
righthand side of (2.1) the function of complete packability. We denote the latter 
by f(z1,...,2437), or by f(X) if the value of r is clear from the context. Corollary 
2.1 is equivalent to the equality 

beg ET) — he 
tee gs kis) max(k, r(k t) ae ) 
This value is realized for (ki,...,k;) = (k —t +1,1°?). 

One of the main questions investigated here consists of estimating the smallest 

possible value of a bound of complete packability, i.e. to compute the quantity 


m(k,t,r) = i min n(k,..., kr). 
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Before passing to this estimation, we note some simplest properties of the function 
and bound of complete packability. 

The monotonicity of f(X;r) with respect to r is characterized in the following 
way. Let r; > re, and let v and w be indices maximizing f(X;r,) and f(X;7r9), 
respectively. Then 


1 ST) (ay 1D) Si A) eH f(A) S| i = 72) Go = 1): 


In fact, 


=e Ee + (= (te 1) = y= (= lw) 2 
SOC) FCS 


2dr, FOr De 1) ~ Yeas (r2 — 1)(@w — 1) > (71 — 72) (@w — 1). 


In particular, this monotonicity characterizes the subset of indices from [é] = 
{1,...,¢} that maximize f(X;r). 

Let I(r) = {i € [t): f(Xsr) = Sh. a; + (r — 1)(ai — 1)} © [t]. Then |Z(r) 9 
I(r + 1)] < 1. Moreover, if a(r) = miner) i, b(r) = maxienryt, then b(r) < 
a(r—1), which, in turn, gives a recurrent way of computing the function of complete 
packability, as follows: 


Oe iene 1) (Seas a ae a-0] : 
where a(1) = 1. 
If X and Y are real vectors of identical dimension, then 
F(X + Yr) < f(Xirn)t+fYir)+r—-1. (2.10) 


In fact, suppose the indices u, v, and w maximize the functions f(X + Y;r), f(X;r), 
and f(Y;r), respectively. Then 


f(Xsn4+hfYor)tr-1l= eS ease > 


j=1 j=l 


> Doles + ys) + (r= Gu + yu VY) = F(X + 57). 


For / a natural number, the following equations hold: 


f(lky,...,lkgr) =Uf(kn,...,kgr) + (r -1)(L—D), (2.11) 
fies Berl 14D) =U (k,n), (2.12) 
PR sckary = Uf (Riper holo (2.13) 


The same equations hold for the bound of complete packability, for the same pa- 
rameters for which the quantity n(k,,...,k:;r) is well defined. 
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What is the meaning of these formulas when 7 is not a natural number? One 
experiences a strange sensation when looking at the righthand side of (2.13): we 
are already able to visualize multisets in which the elements have nonnatural mul- 
tiplicities, but how could we comprehend a partition with a nonnatural number of 
parts? However, these formulas work quite efficiently (which may be due to its level 
of formalization) and make it possible to compute certain conditional extrema. For 
example, if we are to compute the minimum of the function of complete packability, 

a(k,t,ry= , min f(ki,---,kesr), 
not over all vectors (k;,...,k,) / & but only over those in which each component is 
present exactly / times, then, denoting this minimum by py, (lk, lt, r), i.e. putting 


pu(lk, lt,r) = ei min . f(ki,...,kir), 


using the inequality (2.13) we find 
pu(lk, ltr) = lu (k,t,1 + 5). 


For X € N* the function n(X;r) can be defined for r € R by n(X;r) = f(X;r). 
This definition is correct, since f(X;r) is continuous and monotone nondecreasing 
with respect to r. 

The exact values of certain extrema of the function of complete packability are 
given by 


LEMMA 2.4. Let X = (xj,...,2%;) € R*. Then 
1) if Mia Ti =p>0 and f(X;r) = Ee Tes og ah — 1)(r —1), then x; > 0; 


2) typ ee SOS et eS it Sh LZ, and S, is the symmetric 
group of all permutations of the set it] = {lease hy then 
max f@xjoe5 haat) = Da (x; — 1)(r —- 1); (2.14) 


if, moreover, Y~;-,7i > 0, then 


t 
min PGHie ort) = Dee ieee); (2.15) 
; j=l4l 
3) af Sate 0, ey (el Sd) then 
max f(X;r)=1-—r+§, (2.16) 
d(r — 1)" 
2(ré-} — (r — 1)f!) 
4) if Si..2%1 = pe R and1<reR, then 


min f(X;r) = —r+l; (2:17) 


pr‘ 


min f(X;r) = 
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Proor. 1) Assume the contrary: x; < 0, and consider the largest m € [i — 1] for 
which z,, > 0. Then 


m t 

> aj + (tm — 1)(r — 1) > Say + (i — 1)(r - 1), (2.19) 

j=l j=l 
since Xm41,---,2; are negative. If such an m does not exist, then we consider the 
largest m € [i + 1,¢] = [é] \ [2] for which z,, > 0. But then (2.19) is again true, 
since p > 0 and 2,...,2i,€m,---, XZ; are all negative. In fact, in this case (2.19) is 


equivalent to the inequality 


m 
S> 23 + (fm — I(r -— 1) > (ai — 1)(r - 1), 
j=itl 
which follows from the fact that rz, > 0 > x; and \{5";,, 2; 2 0, since p > 0 and 
the last sum is over all positive x;. 

2) Snes tee =— DH 1) Sof eee Vi 4-y Ri) SS maxses, J (Xs) = 
max, max; ae) + max, maxi(%si) — 1)(r — 1) = ya} + (x; — 1)\(r — 1). 
This proves (2.14). 

To prove (2.15) we note the following two facts. 

a) We first show that there is a permutation of the form (2141,.-.,2t,%s(1), +++, £s(2)) 
minimizing f. For this it suffices to verify that if for an arbitrary permutation 
Y = Gigest Ho, Yo = 0, then for Y=. (yg, Wij 0s 6 outs Dodie ess Ya) we have 
£¥) < f(Y). 


In fact, if 
Pp 
f(Y") = q+ do ys + (yp -— I(r - 0), 
mr 
re 
then 


P P 
FY) = doy + Ye — Dr - YD 2 vg t+ Diy + (y — Y(r- DY = FY), 
j=1 j=] 
: iq 
and since )5_, y; > 0, by 1) above we have p # q. 
b) We now prove that if x} >--- > x, then 


min f(€s(1)1---,€s@37) = P2522, 257): 
For this we verify that if 2; < xi4,, then 
Piel) S Pisa Peace eT). 
In fact, we put 
XS itera). ROD te Ue a) 


Then: if f(X") = SJ.1 2; + (Lp — 1)(r — 1), where p i, + 1, then 


f(X) 2 D095 + (a — Wr — 1) = F(X") 
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it f(x) = ss 67a (ea Ye = T)-then 


i+] i+1 
[(X) = Der, + (aa = Dr 1) 2 EZ teat gp ead) ef X4), 
if f(x’) = pay Xj oe (x; = 1)(r == 1); then 
i+] i+] 
F(X) Boas + (Benn — lr — 1) > Say + Hee lr —) = F(X. 
gel =i 


Since every permutation can be represented as a product of transpositions, b), and 
thus (2.15), has been proved. In particular, (2.15) implies that if X is an integral 
vector with positive components, then 


min f(Xs;r) > D0 xj. (2.20) 
‘ E; 


3) 1—r+dr/2 = f(d/2,0,...,0,-d/2;r) < max f(X;r) < max; S412; + 
max;(x; — 1)(r — 1) < d/2 + (d/2 —1)(r —1) = 1—r+dr/2. Thus (2.16) has 
been proved. 

Equality in (2.17) is realized for 


z= d dr‘? dr‘-3(r — 1) d(r — 1)'-? 


We show that (2.17) holds as a lower bound. Let f(X) = ae £3 +(x;—1)(r—-1). 
By 1) above, x; > 0, and then 


d d dr? 
> —— 1 Stata 7 =e —_ a= 
f(X) —_ 2 an ozs F(X, , 21,7) 2 T 2(r! — (r— 1)!) CF 1 
d(r — 1)! d(r — 1)! 
a ee ae ol See 
Fast) = eeag ey, 


Here, the first inequality holds because of (2.15) and the equality following it because 


of (2.18), which proves (2.17). 
4) Let q; = (sent) i @= 1,2..5t)5 O= (H,2-+,@). Then ah = pand 


rt—(r—1)! 


t 
pr 
rt—(r—1) —T + i (2.21) 


and for each i € [t] the equation ag + (qg — 1)(r — 1) = f(Q;r) holds. For 
X = (x1,...,24), S§-, Li = p, the latter implies 


f(Q;r) = 


f(X;r) = a= + f(X —Q;r). (2.22) 
Moreover, 
gua) = fOr) sr T, (2.23) 
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In fact, assuming the contrary, i.e. min f < —r + 1, we are led to the system 
i 
So 2; = 0, 
j=l 


S aj t+rz <0, i=1,...,8, 

j=l 
which is contradictory for r > 1. Indeed, subtracting the equality from the ¢ in- 
equalities we obtain (r —1)x; < x41 +++: +24 (2 = 1,...,¢), whence we successively 
find that x; < 0, x1 < 0,...,2,; < 0, contradicting the equality in this system. 
To prove (2.23) we cannot refer to (2.17) with d = 0, since in the proof of (2.17) 
we have used (2.18). Using now (2.21), (2.22) for k = p and (2.23), we obtain the 
required. O 


COROLLARY 2.4. If k =0 (mod r! — (r — 1)‘), then 


a ee +1: (2.24) 
m OD ae Soe aaa r+1; 
this value is attained for the partition 
kr'-*(r — 1)*"} 
ee |e ee 
(ee ; =i 


We now consider the estimation of m(k,t,r) when k # 0 (mod r* — (r — 1)*). 
Everywhere below, Q denotes the t-dimensional vector with components 


krt-? oS od 
a= (So), patie ak 


ré'—(r— 1)! 


and | = 37}_,{q:}. We note the following property of Q: all components gq; are either 
all nonintegers or all integers; the latter case holds if and only if k is a multiple of 
r‘ —(r—1)*. This follows directly from the fact that the numbers r‘~*(r — 1)*~! and 
r‘ — (r — 1)! are coprime for every 7 € [t]. Consequently, for every i € [t] we have 


1 1 
——_————. < ig;} < ] — ————_—_.. . 
foe rt —(r—1)é (22?) 
Moreover, it is obvious that 1 <1] <t-1. 
We now prove the following lower bound: 
pS a pS OS (2.26) 
9 —_— rt = (r a 1)é ’ ’ = 
where {Q} = ({a},---, {%})- 
If K = (k,...,k) tk, then by (2.22) we have 
ki 
[(Kjr) = ASGaie oe 


In turn, 


HERO n= HK =) ORT) 2s (er) sO ny ra 
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Here, [{Q] = ({qi],---,[gs]) and [g] is the integer part of the number [gq]. The latter 
inequality follows from (2.10) by putting X = Kk — [Q| — {Q}, Y = {Q}. Moreover, 
by the remark accompanying (2.20) we have f(K — [Q];r) > Si_i (ki — [a]) = 
Thus, 

PE =Qlri=JtOh =r hletare laf }r). 
Note that the righthand side of (2.26) is always an integer. The bound given implies 
that if {qu } = maxi<i<i{G}, then 


m(k,t,r) > aos — {au }(r — 1) 
since 
F(EQ}:N) < may Soap} + max {ax} — Dlr — 1) <4 (Cane} — Dlr = D. 


j=l 

Another lower bound is of more geometrical shape; more precisely, if d = min || Z— 
Q\lz,, where the min is taken over all integer vectors Z = (z1,..-,2) such that 
yy, % = k and & is the space of infinite number sequences with finite sum of 
absolute values, then 


kr* i d(r — 1)*! 
ré—(r—l)t  2(r#-) — (r -—1)&}) 
This bound follows immediately from (2.17) and (2.22). In particular, it suggests 
that the minimizing partition cannot be an integral vector that lies too far away 


from Q in the metric of ¢,;. Moreover, it may be used for estimating the bound of 
complete packability of any concrete partition K = (k1,..., kz); 


m(k,t,r) > —r+l. 


kr |K — Qlla(r — )! 
Se ee el. 
WN ee OGG iy 
As upper bound we prove the inequality 
oes eer k(r — 1) > m(k,t,r) 
rt —! (r = 1) Qml — aia , 
where {mz} = ming_141<i<e{Q}. Consider the partition 
K= (lqi], be [4e—u], }Qe—-r411 , oid , lae[) F k 
By 1) above, for it we have 
Di ae os < 
f(K —Q) = max (S6- — 43) + (ki —  — Cr n| < 
< Ec dali — 9%) + max (ki —G —Y(r— I= — | min _tah(r — 1). 


Taking into account (2.22) this implies the required. 
Thus, (2.25), (2.26) and the last estimate imply the two-sided bound 


kré —r4+1 kré+r—1 
ee oie eeeei pee l. 2.27 
er eG m(k,t,r) > ray r+ ( ) 
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We consider some refinements of the general bounds obtained. 
For r < 2 the difference between the upper and lower bounds is always less than 
one, and, since m(k,t,r) is an integer, they provide an exact value: 


kré+r—1 
mk tr) = | EE oo, r= 1,2. (2.28) 
PROPOSITION 2.1. If k,t,p€ N andl <r < 2, then 
pk(p +r —1)* 
ee 1: 
me) |e ie 


moreover, if 


ka (Metre eke te — ee] 
Gir PaCS ie era aaa) 

[ae eRe eel) [ee tral ee } 

eer He See er ee OPS Aye ly? 


where 


_y fpk(ptr =r =) 
= payee 


i=1 
then n( K?; r) = mp(pk, pt; r). 


(For a vector K the notation K? denotes the vector with each component repeated 
p times.) 


PROOF. 
Mp(pk, pt; 7) > [p(pk, pt;r) = p(k, t; 1+ (r —1)/p) = 
k(pt+r—1)# r—1 pk(p+r— 1)! 
SS). aS =: IF 
(mre iS Geel) p (orl) Sraay 


which implies the required lower bound since m, is an integer. On the other hand, 
kK(p+r-ly) 

"trl (1 

consequently, the difference between the upper and lower bounds is strictly less than 


one if r < 2. 
For | = 1 we use the partition 


Oh r= pn (x:14 75") < 
p 


K = ([qJ,---5 [gaa], lanl, [gargil,---s[a]) F & 
in the upper bound, where {qv} = maxi<i<:{qi}. By Lemma 2.4, 1), we have 


M-1 M 
[(K-Q)=- De {qj} + (1. — {am}) — fam} (r —1) = 1-32 {a3} — {am} (r-1) = 


j=l 


= © {a} - {au} (r -1) <1 = {au }(r - 1). 


j=M41 
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In comparison with the lower bound in (2.26) this gives a difference less than one. 
Consequently, if / = 1, 


m(k,t;r) = oe ~. 1)t a oD {9} _ {qu }(r 1); (2.29) 
j=M+1 
or 
m(k,tir) = |q—— = tau r — 1)]- (2.30) 
r'—(r—1) 


Thus, for | = 1 the bound of complete packability minimizes the integral vector 
which is closest to Q in the metric of ¢,. In particular, this gives a complete solution 
of the minimization problem for t = 2. In the general case this phenomenon does 
not always hold; the smallest (with respect to t) example known is k = 422, t = 5, 
r = 5. For these parameters Q = (125.5355, 100.4284, 80.3427, 64.27416, 51.41933). 
The integral vector nearest to Q in the metric of 4; is X = (126, 101, 80, 64,51); 
for it ||X — Qlz, = 2.072346 and n(X;5) = 627. However, for the vector Y = 
(126, 100, 80, 64,52) we have ||Y —Q]|c, = 2.090435 and n(Y; 5) = 626. The smallest 
(with respect to k and r) example known is given by the parameters k = 76, t = 6, 
r= 3. For it, Q = (27.77143, 18.51429, 12.34286, 8.228571, 5.485715, 3.657143). The 
integral vector nearest to Q in the metric of @) is X = (28,19, 12,8,5,4), for which 
|X — Qlle, = 2.114285 and n(X;3) = 83. However, for Y = (28, 18,12,8,6,4) we 
have ||Y — Qe, = 2.171428 and n(Y;3) = 82. O 


THEOREM 2.4. The quantity m(k,t,r) is the smallest integer C' such that the 
recursively defined vector 
C+Hr-1-Soly 
pe eS Ald 
, 


satisfies the equation 
t 
Soy = Ki 
j=l 


We immediately note that by the above given bounds for m(k,t,7r), the possible 
choice of C' does not exceed the difference between these bounds, i.e. r—1. Since the 
sum yaar y; is monotone with respect to C’, this choice can be realized by operations 
whose number is log, r. 


PROOF. Clearly, always n(Y;r) < C, therefore we can put C' = m(k,t,r). If, 
moreover, ea = k, then n(Y;r) = m(k,t,r) (since m(k,t,r) is minimal). If 
Yy=1¥s > k, then, by reducing the necessary amount of components y; of Y by 

4-1 Yj — k, we obtain a partition of k having bound of complete packability at 
most, and hence equal to, m(k,t,r). 

In the alternative case 3; y; < k we consider a minimizing partition X = 
(x1,..-,24)  k which is ‘largest’ in the sense of the lexicographical ordering > on 
P,(k), and also consider the smallest 7 € [t] for which x; > y:. Clearly, such an 7 
exists always, and [i — 1] 4 @ since a; < [(m+r—1)/r]=y-1. 
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There is a j € [2 — 1] such that x; < y;, since otherwise (x; = y;, 7 = 1,...,i-1, 
xi > yi) we would have $95_, 2; + (r — 1)(a: — 1) > m(k,t,r). In fact, since in this 
case x; > y; + 1, we would have 


Yay t (= Yei-— > Ny + r— Dw Y +r > mkt), 


because nonsatisfaction of the last inequality is equivalent to 
m—-1—-Sizly; 


We now consider a j € [¢ — 1] such that z; < y,;, and estimate the bound of 
complete packability, n(X’;r), for the partition 


Dn NE (Uy, ay yp a Wy ye ee 


Let v be an index maximizing n(X’';r), i.e. 
n(X';r) = py as (imeest 8 | Cree 


If v > 4, then clearly n(X’;r) = n(X;r) = m(k,t,7r). 
If v = 7, then 


n =x, ie Ne Nea ea) vie 


w= 


= Yews + (r~1)(@,-1) -—r+2<n(X;r)-—r+2=m(k,t,r) —7r 4 2, 


which does not exceed m(k,t,r) for r > 2. 
If vu <7, then zi, < yy (w = 1,...,v). Thus, in this case too, 


n(X'sr) = 3 x, + (r— 1)(zi, — 1) < n(Y;r) < m(k,t,r). 


w=l1 


Thus, n(X’;r) < m(k,t,r), and hence n(X‘';r) = m(k,t,7r), i.e. the partition X’ 
is also minimizing. At the same time X’ > X, contradicting the maximality of X 
in the sense of the lexicographical ordering. O 


Let t(k,r) be the smallest ¢ for which there is a partition of k with ¢ parts that 
can be packed into every partition of k with r parts. 


COROLLARY 2.5. t(k,r) is the smallest t such that the recursively defined vector 


k+r-1-S2 1 Yi : 
Yi = Pill eer nem eA j Jew Weer 
T 


satisfies the equation 


t 
Speak 
j=1 
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In fact, this ¢ is the smallest root of the equation k = m(k,t,r). Here, the choice 
is again restricted by bounds available for t. These bounds follow from those for 
m(k,t,r): 


In(k +r —1) 
In(r) — In(r — 1) 


In(k + r — 1) —In(r - 1) 
In(r) — In(r — 1) 


2 t(k,r) > 


In fact, Theorem 2.4 does not only give the exact value of m(k,t,r), but also 
bounds t(k,r). More precisely, ift > t(k,r), then the number of nonzero components 
of the vector Y in the statement of Theorem 2.4 is exactly t(k,r). Clearly, if t > 
t(k,r), then m(k,t,r) = k. Thus, for t > t(k,r) this vector Y must have nonzero 
components. If in this case it is necessary to construct the partition p € P,(k) 
minimizing the bound of complete packability n(p;r) = k, then we can clearly take 
p to be any partition in P;(k) which can be packed into the vector Y, regarded as a 
partition. The lexicographically largest such partition can also be given recursively, 
and has, fori = 1,...,t¢, the form 


k+r—1—-Sii ky val 
k= min (| : yah k-S bats). 
r 


j=l 


7. Weighing 


‘Every sensitive body, of whatever shape, placed on balances in equi- 
librium with weights, in all positions absolutely retains some equilib- 
rium .... One often neglects or even rushes by simple and everyday 
phenomena which in the examination of nature give rein to great reve- 
lations, and one pre-accepts and searches for difficult tests, forgetting 
the once famous example, i.e. the simplicity and undoubtful mathe- 
matical fundament that every thing equals itself by quantity, to which 
almost all of mathematics appeals’. 


Such wrote M.V. Lomonosov in its work ‘Discussions on the rigidity and wateriness 
of bodies’. Here we turn to this ‘famous example’ of human activity: weighing. 

If, on a balance weight, we place a load of weight n in one pan and balance it by 
weights n,,...,m, in the other pan, then we have n = n; +---+n,. Since balance 
‘in all positions absolutely retains some equilibrium’, the order of the terms in this 
sum is unimportant in essence. Thus, the latter sum must be some partition of n. 

We now present ourselves with the real situation that we have to weight a load 
V by weights ky,...,k, with total weight k) + --- +k, > V, and we describe this 
situation analytically. If V is balanced by some of the k;, then 


V =k, +--+ ka, 


where 1 < i; <--- <a <t. Of course, we do not necessarily need all k; (1 <7 < t) 
to balance V. But what about the remaining weights k;,,,,,.-.,ki,, ie. those not 
taking part in balancing V? It is obvious that they also balance some load, to be 
precise k — V, i.e. together with the identity above another identity holds: 


ee ne ee ee 
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So, the system of equations thus obtained is a packing of the partition (k,..., kz) 
into the partition (V,k — V). Consequently, this apt relation between weighing and 
partitions and packability leads to the following facts. 


Fact 2.1. A system of weights k,...,k; of total weight k balances a load V 
exactly if and only if there is a packing (ky,...,k:) C (V,k—V). 


Fact 2.2. A system of weights k),...,k; of total weight k balances any integer 
load not heavier than k if and only if the partition (k1,...,k,) can be packed into 
every partition of k having two parts. 


We stress the meaning of these facts: in them real engineering phenomena are ad- 
equately described by only the abstract mathematical notions of number partitions 
and the binary relation of packability of partitions. 

The notions of partition and packability not only adequately describe an equilib- 
rium state, but also turn out to be useful in solving many weighing problems, among 
which we take as initial one the following old problem. 


Problem. How many weights does one minimally need to weigh any 
integer number of pounds from 1 to k? 


For k = 40 this problem is stated in ‘The Book of Abacus’ (1202 A.D.) by the Ital- 
ian merchant and mathematician Leonardo Pisano, whose nickname was Fibonacci 
(which means ‘Son of Bonacco’). 


Fibonacci travelled for a long time in the Orient, where he met with the Arab 
mathematicians Al-Khwarizmi, Abu Kamil, Mohammed-ben-Muza. In turn, they 
drew his attention to the mathematicians of India, notably Brahmagupta. The works 
‘The Book of Abacus’, ‘The Practice of Geometry’, ‘Algebra’, ‘Treatise on Square 
Numbers’ kept up and nourished the European mathematicians of the Middle Ages. 
They played an essential role in spreading the Arab ciphers. Thus, Leonardo reported 
on how one could write any number by using the 9 ciphers and zero, called the Arab 
ciphers. 


The name of zero, al zyfr (tefiro, tefro, zero), gave also the name of the signs 
themselves: zifrae, eifrae. 

Under his nickname he became immortal for the Fibonacci sequence 1, 1,2, 3,5, 8, 
13,..., in which the sum of two subsequent terms gives the next term. This sequence, 
which characterizes in Leonardo’s original treatise the fertility of rabbits, turned out 
to be far more fertile in mathematics than the phenomenon first described by it. 

Another Italian mathematician was drawn to this problem, Lucas Paccioli (1445- 
1514). The inscription on his tombstone in Sansepolgro, erected in 1878, reads: 


‘To Lucas Paccioli, who was a friend and contemporary of Leonardo da Vinci and 
Leon Batista Alberti, who first gave algebra its language and the structure of a sci- 
ence, who applied his great discoveries to geometry, invented double bookkeeping, 
and gave in mathematical treatises the basics and invariable norms for subsequent 
investigations’ 
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This list can be completed by yet another achievement: Paccioli undertook the 
first attempts at a geometrical modeling of writing. The well-known harmonic square 
of Leonardo da Vinci contains a human being; the harmonic square of Paccioli 
contains a letter. 

Paccioli proposed (1509) to construct letters on the basis of a square (each side 
of which was partitioned into nine parts). 

This problem is also encountered in the book ‘Applied and Remarkable Problems 
on Numbers’, Lyon, 1612. Its author, the Frenchman Claude Gaspar Bachet de 
Méziriac (1587-1638), was a remarkable person in many respects: poetry, mathe- 
matics, translator, publisher and commentator of the ‘Arithmetic’ of Diophantus 
of Alexandrié, the same edition in which the ‘margin was too small’ to contain 
the proof of the great Fermat theorem. The Russian translation of Bachet’s book 
‘Games and Problems Based on Mathematics’ appeared in St. Petersburg in 1877. 
Bachet had a preference for problems of Antiquity, as well as for those based on 
some real situations. The weighing problem is sometimes termed Bachet’s problem. 
A propos, it is Bachet to whom we owe a present-day text of the description of life 
in Aesop. 

An analytic approach to similar problems was proposed by Leonard Euler (1707— 
1783). His approach is based on the method of generating functions developed 
by him. In his ‘Analysis Infinitorum’, more precisely in the chapter on number 
partitions, Euler concludes from the easily established analytic identity 


(+2) 4¢a27) (lta (l4+e%))-=ltote?toertettar t+... 


‘that every nonnegative integral] number can be obtained from various terms of the 
double [i-e. with multiplier 2} geometrical progression 1, 2, 4,8, 16,32,... by addition, 
and moreover, in a unique way’. 

Consider this identity for an arbitrary finite case, i.e. cut off the multiplication at 
the lefthand side after the first four factors. The quantities x* (k = 0,1,2,...) at 
the righthand side of this identity can be obtained by multiplying out the braces at 
the lefthand side. Since the coefficient of each x* (k = 0,1,2,...) at the righthand 
side is 1, this means that each x* at the righthand side can be obtained in a unique 
way by multiplying out the braces at the lefthand side. So, in fact each k can be 
represented as the sum of certain powers of 2 and, moreover, in a unique way. 

In turn, Euler concludes from the identity 


(l+atal)(l+a° +x 4)(lta°+2%)---=l+zta +27 tarterta rt... 
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‘that all [integral] numbers can be obtained from terms of the triple geometrical 
progression by addition and subtraction, and moreover for each there is a unique 
manner’. 


L. Euler also mentioned a practical side of the problem: ‘It is known from the 
practice of weighing that if we have weights of 1,2, 4,8, 16,32, etc., pounds, then we 
can balance all loads with them, if we do not require parts of a pound to be balanced. 
Moreover, it is known from the practice of weighing that with an even smaller number 
of weights, growing in triple (i.e. with multiplier 3) geometrical progression, more 
precisely, of 1,3,9,27,81, etc. pounds, we can balance all loads, if we do not require 
parts to be balanced. However, if it is required to balance in this way we have to 
place weights not on a single pan but on both pans’. 

These facts were successfully used for a long time. Already the Ancient Egyptians 
used the first of them for multiplying numbers, and Luca Paccioli had zealously 
struggled with this manner of multiplication. In general, measuring practice has 
brought about various measuring scales, among which those proportional to 2 or 3 
are most widespread. However, among such scales we also encounter such that are, 
at first glance, not natural, e.g. the Fibonacci sequence. 

The very process of weighing has been accompanied at times by special regulations: 

‘Lifter (outdated) — a trading tax, collected when weighing commodities in certain 
cities. The naming arose because of the lifting of commodities on weighing instru- 
ments. The lifting tar was only due for heavy commodities sold or bought in large 
quantities, by both the buyer and the seller. The amount due to customs varied from 
one-half to five coins (Encyclopaedic Dictionary of Brokhaus and Efron).’ 


Euler took most actively part in the practice of weighing; so, he computed the 
amount of deviation of the beam of hand balances from being horizontal. In Russia, 
the 1797 law on measurements and balances prescribed to prepare balance weights of 
1 and 2 pud, 1,3, 9,27 pounds, and 1, 3, 9,27, and 81 zolotniks.! Thus, by and large 
a triple scale was specified. This testifies of the advanced nature of the measuring 
thought in Russia. Despite the large spread of special instructions to use such 
weights, this scale did not last long: one had already ‘catched’ an unknown integer 
scale of weighing by the triple scale for two-pan weighing. Already then the factor 
of computing in real time turned out to be practically significant. 

The great Russian chemist D.I. Mendeleev (1834-1907) headed in 1893 the Weights 
and Measure Office, and remained there as head until his death. He gave theoretical 
views on the construction of balances, as well as practical conclusions from them, in 
his work ‘On the expansion of gases’. Together with his companions and followers he 
gave intense attention also to a problem of ‘no little importance—the establishment 
of a rational number of balance weights’. The problem on weight is nowadays called 
the Bachet-Mendeleev problem. 

It can be imagined that after the reasoning of Euler the problem was exhaust- 
edly treated—to balance any integral load not heavier than k it suffices to take a 
smallest necessary amount of weights occurring in the double or triple geometrical 
progression. So, in the case of p-pan weighing (i.e. weighing for which the weights 


1A zolotnik is 1/96 of a (Russian) pound. 
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can be put in p pans, p = 1,2) the number of these weights will be a quantity of 
magnitude log,, ,(pk + 1), since the sum 


t-1 _ (pl) 1 
D 

must be at least k. For example, if k = 13, then for single-pan weighing we can use 
the weights (1, 2, 4,8), and for two-pan weighing the weights (1,3, 9). However, prac- 
tice turns out to be wider than theoretical conclusions: there are more economical 
systems of weights. So, to balance a load not heavier than 13 with single-pan weigh- 
ing we can use other systems of weights, e.g. (1,2,3,7) or (1,2, 4,6); they are more 
economical in the sense that their total sum equals 13, while 1+2+4+4+8 = 15 > 13. 
Why would we use systems of weights whose total sum exceeds the weight of the 
load to be weighted? Therefore the main question can be stated as 


1+(p+1)+(p+1)? +--+ + (p41) 


compute the least number of weights, t,(k), for p-pan weighing, and 
describe a system of weights, of total sum k, making it possible to 
weigh any integer load not heavier than k. 


The present state of answering this question is given in the following Theorem. 


THEOREM 2.5. Suppose all parameters occurring in this Theorem are integers. 
Let |x] be the integer part of a number x and |x| = [—a]. 

1. For p-pan weighing, a system of weights ki > --- > k, of total sum k makes 

it possible to exactly weigh any integer load not heavier than k if and only if 


t 
ki<1ltp >> ky, i=1,...,¢. 
j=it+l 
2. For p-pan weighing, the number t,(k), i.e. the least possible number of weights 
necessary to weigh any integer load not heavier than k, is given by 


tp(k) = }log,.,(pk + 1)|. 


3. For p-pan weighing, the smallest possible system of weights making it possible 
to weigh any integer load not heavier than k is recurrently given by 


ba = psn ke 
fy = [RE i=1,...,¢= t(k); 


for it we always have 


We will demonstrate this assertions of Theorem by a concrete example. Let k = 40 
and p = 1, i.e. we look for the smallest system of weights in single pan weighing. 
According to the second assertion of the Theorem, t,(40) = |log(41)[ = 6. By the 
third part of the Theorem we can recurrently obtain the weights: k, = [41/2] = 20, 
These weights, (20, 10,5, 3,1, 1), have sum 40. Can we use other systems of weights, 
e.g. (19,10,6,3,1,1)? The first part of the Theorem gives a positive answer to 
this, since it implies that, in particular, k; < (1+ p)’*,i=1,...,t. The fact that 
t2(40) = jlog,(81)| = log,(81) = 4 gives support to the assumption that we can allow 
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FIGURE 2.4. A drawing by M.V. Lomonosov 


FIGURE 2.5 


for two-pan weighing in the original statement of the problem, although Bachet only 
analyzed single pan weighing. 


Engineering problem. It 1s never asserted in the Theorem that p should only take 
the values 1,2. All formulas remain valid also for larger p. Try, ‘with the point of 
a pen’, to extract from these formulas balances constructed such that all formulas of 
the Theorem are realized for each natural number p. 


Euler’s words that ‘questions of this nature clearly exceed the means of analysis 
without these means’ remains true. Therefore the approach expounded above is 
directly related to the initial problem statements and gives the required information. 
At the basis of the approach there are extremal results, i.e. results describing only 
limiting admissible possibilities, and this turns out to be sufficient. The solution of 
extremal problems leads to the deep structural analysis of a real phenomenon, gives 
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a characterization and concrete realization of it, and, finally, describes general laws 
or an extremal law. 

As already noted, the relation between packability of partitions and weighing is 
based on the fact that the system of weights (k,...,k;) | k makes it possible to 
weigh, with single pan weighing, a load v < k if and only if (ky,...,k:) C (vu,k—v). 

Two-pan weighing can be reduced to single pan weighing: a system of weights 
(k,,...,k,) + k makes it possible to weigh, with two-pan weighing, a load v < k if 
and only if (k?,...,k?) C (k-—u,k +0). 

In fact, the equality v = Y5_, 6;k;, where e; € {0,1,—1}, is equivalent to the 
equality k + v = 4_, njk;, where nj = (ej + 1) € {0,1, 2}. 

Therefore, for p-pan weighing, a system of weights (kj,..., k;,) / & makes it possi- 
ble to exactly weigh any integer load not heavier than k if and only if the partition 
(kY,...,k?) can be packed into all partitions of pk of the form (k — v,(p— 1)k +), 
a eee 

By the principle of complete packability, it is now obvious that a partition (k7,..., k?) 
can be packed into all partitions of pk of the form (k—v, (p—1)k+v), v = 0,...,k, if 
and only if pk > n(kf,..., kP; 2), which, in turn, by (2.1) is equivalent to the system 
of inequalities 


t 
kj<lt+p >> ky < (p+), i= 1,...,¢. 
j=itl 
Consequently, the least number of weights, t,(k), needed for such p-pan weighing 
is the smallest number ¢ such that pk = m,(k,t,2), where 
my(kytyr) =, min n(k,...,Afir). 


Since 

mp(pk, pt, 2) > bp(pk, pt, 2) = pp (k,t,1 +1), 
we have k > p(k,t,1 + as and thus, by (2.8), (p + 1) > pk +1. Taking into 
account the upper bound for ¢, we find that t,(k) is given by the formula t,(k) = 
log, 41 (pk + 1) i and the weights may be given recursively by, e.g., 


pk+1—pxhy; . 
u=| ie 28 Oe on Dene (3 


There is also an explicit solution for p-pan weighing with unique weights; more 
precisely, for the smallest system of weights we may take 


x= ([Re feaes ees | | pk ) 
ree ate MMMM Oe Ge lO Oe a 6 9 el ad 


where ‘ k 1 
pk(p + 1)"* 
i= me} and t=ti(k); 
Here, {x} denotes the fractional part of the number 2g, i.e. {x} = x— [az]. For p > 2 
the meaning of these expressions is extremely simple, and they can be realized by 
weighing on balances which are special, but given by a natural construction. They 
are unequal-arm beam balances, having at the shorter arm a pan for weighing loads 
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only, and at the other arm p pans with mutual distances which are all equal to the 
length of the smaller arm. 

Comparing now the recurrently constructed smallest system of weights from part 
3 of the Theorem with the above system for t = t,(k), we see that they can be 
different: for k = 40 already, the last system has the form (20,8,6,3,2,1). This 
means that the problem of giving an explicit representation of all smallest systems 
of weights is yet still to be solved. 

Already when taking a fleeting glance at the above said there naturally arises the 
following question: For what weighing procedure can we know the exact values of 
m(k,t,r) for r > 2? In other words, the solution of what engineering problem would 
give us a computation of m(k,t,r) in full generality? Again, the answer can be seen 
from the following basic facts.The system of identities described above not only 
shows the packability (k,,...,k:) C (v,k—v), but also gives an analytic description 
of two simultaneous weighings of the loads v and k—v with two independent balance 
weights. This leads to the phenomenon of simultaneous (or parallel) weighing using 
different balance weight and the same system of weights (i.e. the weights taking part 
on one balance weight do not participate on the other). 

One of two worthwhile sellers has gone away and has given his weights to the 
remaining one, who must simultaneously serve the buyers: This is an illustration for 
parallel weighing on two balance weights. And so one can readily see the generality 
of the notion of packability of partitions brought about by the new real situation. 

While for single pan weighing and single weighing the sequence 


1,2, 4,8, 16, 32,... 


turns out to be most efficient, for two independent weighings its analog is the already 
somewhat more different sequence 


1,1,2,3,4,6,9,14,.... 


At first glance the construction law of this sequence is not clear. However, it can 
be clearly imagined; we only have to describe the conditions of independent weighing 
in terms of packability of partitions. Similar to the above we have 


- for single pan weighing with r — 1 independent balances, weighing loads 
(V1,.--,Ur-1) Hu < k by weights (k,,...,k:) & & can be realized if and only 
if there is a packing (ki,..., kt) C (v1,---, Up-1,k — v). 

- for single pan weighing with r—1 independent balances, weights (k,,...,k:) & 
k make it possible to simultaneously and exactly weigh any r—1 integer loads 
of total weight at most k if and only if the partition (k,,...,k,) can be packed 
into every partition of k with at most r parts. 

By the principle of complete packability, the latter holds if and only if k > 
WKissxa, kET), Or 


ky bo bh mpc Shs (be Nee nf 
j=l 
or 


> kj 2 (ki — 1)(r - 1), 


jai4l 
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or 
1 


r—| 


Sky > ki. 


j=itl 


Ieee 


An extremal solution of the latter system of inequalities is given by the se- 
quence 1,1, 2,3,4,6,9,14,.... 

All extremal characteristics for single pan parallel weighing can thus be computed 
from the extremal results obtained above. 

It is not without. interest to ask whether we can obtain this strange sequence for 
parallel weighing with two balances also by the method of generating functions. Or 
do we here encounter the limit of ‘the strength of analysis’ from the above-given 
quotation of Euler?However, the generating function for single pan two-dimensional 
weighing can be given sufficiently simply: 


(4+zrt¢yPlt2e?+yY)l+e2tyylt+a2% ty )(lt2%+y9)..., 


or, in the general case, 


[Ia + 2% + y™), 

i=1 
Similar to the second equation, Euler writes down the general form of the generating 
function for two-pan two-dimensional weighing (i.e. with two independent balances): 


[oe 

[[G + 2% +07°% + y™ +97). 

i=1 
However, an indication of the rapidly growing sequence {a;} in this general form 
has up till now been difficult. 

It is clear that such a model of weighing is equivalent to the realization of all 
outcomes in an allocation scheme. In particular, this implies that for fixed r and 
n, tending to infinity, the smallest possible number of groups that can realize all 
outcomes in the scheme of allocating n indistinguishable particles over r indistin- 
guishable boxes behaves asymptotically as 


In(n +r —1) 
In(r) —In(r — 1)’ 


with rest term (which never exceeds zero) of order 


In(r — 1) 
In(r) —In(r — 1) 


This system with minimal number of groups is given by the vector Y of Corollary 
2.4; 


If we allow two-pan weighings on all r — 1 balances, and (k;,...,k;) + k& realizes 
the simultaneous weighing, then rk > n(kj,..., kf; 7). 
In fact, in two-pan weighing on r—1 balances, if asystem of weights (k1,...,k:) Fk 


simultaneously weights r — 1 loads v,...,vp—-1 of total weight at most k, then the 
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system of inequalities 


t 
vj = DG ki, where j = 1,...,r—1, ei, =0,1,—1, 
t=1 


i 
Sle) <1, a ee 
i=l 

holds. Clearly, this system is equivalent to the system of inequalities 


t 
ku; = Do n;ki, where j = 1,...,7—1, m, = 1-4, = 0,1,2, 
i=1 


t 
So ll-m,| <1, PS ech 
t=1 


Thus, by Dirichlet’s principle, *_, nm, <r (t= 1,...,t). Therefore 

(ki,--.,ke) C(k—,..., 8 — Up, Rk +01 + +++ + Up_1), 
i.e. if the system of weights (k,,...,k;) / & provides simultaneous weighing of any 
r — | integer loads of total weight at most k on r — 1 balances, then the partition 


(ki, ..., kf) can be packed into all partitions of rk with r parts for which the (r—1)st 
part does not exceed k. By Theorem 2.3 this is equivalent to the inequality 


rk > nki,..., kf; bk"). 
But since 
Ni sciake kh’ ) Sk acca, ker) 
this is also equivalent to the inequality 
pee Re eather): 


Thus, if a system of weights (k1,..., k¢) k & provides simultaneous two-pan weigh- 
ing of any r — | integer loads of total weight at most k on r — 1 balances, then 


k; 
: eh eee a 
r—| , 


t 
k<1ltr > 


j=t41 
and for the smallest amount, ¢, of such weights we have the bound 


rkt+r—1 
t > logar—1)/(r-1) erm aa 


since n(ki,...,kf;r) = rn(ki,...,kf2—1/r) and, by Proposition 2.1, 


1\ k(2r —1)° r—1 
m(k41-3)=|_-“r - r 


In the case of simultaneous two-pan weighing on two balances the system of 
weights {]2*~'[}:-o.1,... is suitable. 

Of course, neither Lucas Paccioli, nor Bachet de Méziriac could have imagined 
that the weighing problem is in itself a computing technique-for fast raising to a 
power and with computer memory fragmentation. 
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In the process of functioning of the memory of a computing device there appears 
a separation between occupied and free parts: fragmentation. If it is then also 
necessary to enter new information into the computer’s memory, e.g. programs and 


array data, requiring amounts of memory k,,...,k;, then there arises a natural 
question: Can the latter be distributed over the fragments of free memory n1,...,N,? 
According to the principle of complete packability this can be done of S77_,n; > 
n(ky, ee Kt; r). 


Weighing makes its appearance into computing techniques, and even in explicit 
form. The arithmetical balances of Cassini, described in the ‘Collection on Devices 
of the (Paris) Academy of Sciences’ (up to 1699), are well known, as is the rising 
bridge constructed according to the system of general Poncelet in the fortification 
‘Mont Valerien’ near Paris. 


On the basis of weighing the numerist-diagrammetrist V.S. Kozlova developed a 
mechanical image of a computer with screen. On this invention, Fk. Lucas published 
his 1890 lecture at the Paresian National Museum of Arts and Trade, where, by the 
way, one of the two arithmometers for continuous movement of P.L. Chebyshev is 
kept. The general conclusion of FE. Lucas regarding the apparatus of V.S. Kozlova was 
as follows: ‘We think that this apparatus can be utilized, and it will be used in various 
forms, convenient to some or other requirements of experimenters. The present 
model of the diagrammeter is only the contemporary hull of the genial idea of Mister 
Kozlova. I also think that it would be more convenient to replace the hand balances by 
spring balances. On this apparatus we can obtain the formulas of Simpson, Poncelet, 
and general Parmantet, and, in general, all quadrature formulas’. 


Mendeleev noted that ‘in the nature of measurement and balances there are prin- 
cipal instruments of knowledge’, that is why all ‘notation’ scales are important, i.e. 
allowing one to precisely measure or determine a large amount of unknowns. 

A sequence ky < kg < ... and a natural number p are called a scale of no- 
tation if each natural number n can be written in the form n = >;3, ek; with 
e, € {0,1,...,p}. Verify that ({ki},p) is a scale of notation if and only if k; < 
Lp hie = ees 


Try to analyze the Fibonacci sequence as a scale of notation. In solving this question 
some unexpected surprises will await you. 


The above-mentioned Fibonacci representation of numbers by Arab ciphers and 
zero follows from this too. For the notation of numbers one also uses the well-known 
Roman ciphers. A distinguished economy of the Roman system is: to write down 
the first 39 numbers one uses only the 3 ciphers J, V, and X; and for 40 a fourth 
one: L. This leads to the thought that the Roman system can be characterized by 
two-pan weighing Try to do this on your own. 

These examples show that the results on weighing obtained above can be trans- 
ferred to various real phenomena related with measuring collections. According to 
Euler, ‘a constant. collection is a definite collection, always preserving one and the 
same value. Such collections are numbers, because they preserve one and the same, 
once-received value’. Therefore one measures and computes collections by numbers, 
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of which historically the first was the notion of natural number. ‘All numbers as we 
know consist of a certain amount of the unit’; following Euclid, this was the defini- 
tion given by Diophantus of Alexandrié in his ‘Arithmetic’. It can be traced back to 
Plato and Aristoteles. The Ancient Greeks ascribed the invention of numbers to the 
legendary Prometeus, son of Femida, goddess of justice, ordinarily depicted with a 
balance. 
We end this Section on the ‘everyday phenomenon’ of weighing with the following 
‘dialog’: 
‘... with the course of time science grows up, and there is the need to 
combine all that was separate before ... there is the need to change 
the anatomical study of a subject by a physiological’. (S.M. Solov’ev) 


‘But there has to flow yet quite some water in the Rhine before schools, 
finally, discover that mathematics can be a humanitarian science and 
that students can understand Euler equally well as they do Plato and 
Goethe.’ (Andreas Speizer) 


8. Lattice structures in the refinement order of number partitions! 


As noted before, there are several partial orders on the set of partitions: lexico- 
graphic, power set, refinement, partial sums, concatenation. The refinement order 
on number partitions is perhaps the most natural, although little is known about 
it. The few authors who have investigated this order structure, have insisted that 
its ‘non-lattice’ character is the primary drawback to progress (cf. [198], [151]), 
for, among other consequences, the computation of its Mobius function becomes 
unwieldy. Among the few positive results, the authors of [127] have enumerated 
the number of maximal chains in the refinement order. These observations notwith- 
standing, our aim in this Section is to show that the refinement order actually carries 
much lattice structure, even if, as a whole, it is not itself a lattice. 

The refinement order on the set P(n) of all number partitions of n is the transitive 
closure of the following successor relation: m,m2...™Mp < mm ...m,_,, where the 
set of parts m;, coincides with that of the parts m; except for one part m= m; + Mx 
for certain j,k € {1,2,...,p}. We illustrate the upward drawing of P(6) in Figure 
2.6; notice that it is not a lattice. 

Closely related is the set II(n) of all set partitzons {S;: i € I} of an n-element 
set {@1,...,@n}. The S;’s are the blocks of the set partition; the type of the set 
partition is the nondecreasing sequence of positive integers corresponding to the 
sizes of the blocks arranged in nondecreasing order. The set partitions are ordered 
by: {Si: 7€ I} < (Sj: 7 € J} if, for each i € J, there is a j € J such that S; C Si. 
This (refinement) order turns II(n) into a lattice, the much studied set partition 
lattice. Much is known about its algebraic structure (cf. [201]), its Mobius function 
is known [200], and like the refinement order for number partitions, a precise formula 
is known for the number of maximal chains between the top and the bottom. II(4) 
is illustrated in Figure 2.7. 

There are at least two other interesting links between the refinement order on 
number partitions and the partition lattice. 


2This Section was written by Ivan Rival (Dept. Computer Sc. Univ. of Ottawa, Canada) and B. Stechkin 
(Steklov Math. Inst. of Moscow, Russia), February 18, 1992. 
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FIGURE 2.6. The refinement order of P(6) 


arbicid 


FIGURE 2.7. The refinement order of II(4) on the set of all set partitions of a 
4-element set and the corresponding refinement order of P(4) on the collection of 
its set. partition types. 
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In the first place it is obvious that the types of set partitions are themselves 
number partitions. We consider an order on the collection of set partition types: 
m,M2--+ < mm‘... if there exist corresponding comparable set partitions {5;: 
i= 1,2,...} < {S): 7 = 1,2,...} in II(m). (Equivalently, this is the order induced 
by II(n) on the orbits of its automorphisms.) This natural relation between II(n) 
and P(n) has been noted by several authors (cf. [151], [79]). We illustrate this for 
n = Ain Figure 2.7. Of course, P(n) is not order embeddable in II(n): although the 
lengths of the two orders are identical, P(n) is not a lattice, for every n > 5. 

The second link is not as obvious. It is the substance of our first result, a connec- 
tion between arithmetic and lattice structure. It is inspired by the simple observation 
that number partitions with increasing parts sufficiently far apart behave much as 
set partitions. 


THEOREM 2.6. (i) Forn = p= 5, the up set of all number partitions above 
M,...Mp in P(n) is a lattice if and only if Viermi A Njez™m,; for each 
I,J c {1,2,...,p} such that IN J =9@ and |IUJ| <p. 

(ii) Moreover, in this case, the up set above m,...™Mp in the refinement order of 
P(n) is isomorphic to the set partition lattice II(p) on a p-element set. 

(iii) II(logn) is the largest set partition lattice isomorphic to an up set in the 
refinement order of P(n). 


Thus, the number partitions above 1234 in P(10) will not form a lattice, although 
the number partitions above 1248 in P(15), or even above 1247 in P(14), form a 
lattice. The refinement order on number partitions also contains large down sets 
with lattice structure. 


PROPOSITION 2.2. There is a number partition with |n/3| parts such that the 
down set of all number partitions below it, in P(n), is a distributive lattice of length 
|(2/3)n]. Moreover, for n > 6, |(2/3)n| is the length of the longest distributive 
lattice tsomorphic to a down set in P(n). 


Up till now, the calculation of the Mobius function for the refinement order of 
number partitions has remained virtually inaccessible. At the same time its calcu- 
lation, on distributive lattices and on set partition lattices, is fully understood. It 
follows, therefore, from Theorem 2.6 and from Proposition 2.2 that, at least on large 
intervals of the refinement order for number partitions, we have a full understanding 
of the Mobius function. 

So much for the lattice structure of down sets and up sets in P(n). 

Although P(n) is itself not a lattice for every n > 5, there is a ‘regularity’ to its 
upward drawings (cf. Figure 2.9) which suggested to us the idea to delete compa- 
rabilities to obtain lattice structure. And, indeed, there is a natural way to define 
such an order. 

We define the concatenation refinement order or C-order C(n) on the set of all 
number partitions of n as the transitive closure of this successor relation: 


1 aap SF AT a 
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FIGURE 2.8. The C-orders C(6) and C(10) 


if there is an 2 such that 


m, aol M5 + M541 it 7 1, 
Mj+1 ifj>i=. 


The difference with the refinement order lies in the following two distinctive features 
of the C-order: 


(i) any successor of a number partition is constructed by summing consecutive 
terms, 
(ii) this new summand satisfies the inequality m; + mii < Mize. 


This inequality condition on the summand suggests a ‘Fibonacci’ property. 

It is convenient to examine the structure of a lattice by identifying its supremum- 
and infimum-irreducible elements for, as is well known, the subset of these elements 
determines the lattice. As the elements of the C-order are, in the first place, explic- 
itly described, it is a relatively light matter to describe its irreducibles. For instance, 
if a number partition m,...m, has parts m;,m;, 77 j, such that 


2mM-1 <M < Miz1 


and 
2mj-1 <M; < M41, 
then it is supremum irreducible in this C-order. 

Although the upward drawing of the refinement order would seem too cluttered 
to be of any use in ‘reading’ the order, this C-order is ‘weak’ enough for its upward 
drawing to be constructed with little difficulty and to be actually ‘read’. For in- 
stance, in Figure 2.8, we illustrate the C-order for n = 10 and, for convenience, we 
label only its union-irreducible and intersection-irreducible elements. In Figure 2.9, 
we illustrate the refinement order of number partitions for n = 9. Its elements are 
traced from their corresponding coordinate positions in Figure 2.8, and its labels 
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FIGURE 2.9. The upward drawing of P(9) 


can, thereby, be readily reconstructed from the labels in Figure 2.8 too, as the down 
set of all number partitions beginning with the part 1. 


THEOREM 2.7. The C-order on the set of all number partitions is a lattice. 


Besides the improved prospects for investigating the order structure of number 
partitions using these upward drawings, the C-order leads surprisingly to an upper 
bound on the number p,(n) of all number partitions of n with r parts. It is well 


known that 
lfn-1 
pr(n) = *( 
ri\r—1 


and, for r = o(n/%), this is (asymptotically) equality |199]. 


THEOREM 2.8. For any positive integers r <n, 


Pr(n) < (ae 


8.1. Proofs of the results. 


PROOF OF THEOREM 2.6. It is easy to verify that P(n) is a lattice for n = 4 
and for p = 1,2,3. (There are too few parts in these number partitions to produce 
much structure at all!) The smallest ordered set with top and bottom which is not 
a lattice is the 6-element ordered set illustrated in Figure 2.10. If p = 4 and all 
four parts are identical, then the up set above m,;m2m3mMz is still a lattice (such a 
partition has precisely one upper cover). If p = 4 but there are just two unequal 
parts, the up set is still a lattice (its upper cover has at most one number partition 
with three distinct parts). If p = 4 and there are at least three distinct parts, we 
claim that the assertion of the theorem holds too. 

Let m,...m, € P(n) satisfy Vie mi F jez mM, for every I,J C {1,...,p} such 
that 1Q J = @ and |I UJ| < p. As there are no duplicate proper disjoint partial 
sums, addition of numbers plays the same role as juxtaposition of these numbers as 
digits. In this sense, the refinement order on the set of all number partitions, above 
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FIGURE 2.10. The smallest non-lattice with top and bottom 


mM ...™Mp in P(n), is precisely the same as the refinement order on the set of all set 
partitions on the p-element set {7m ,...,™p}. 

Let us suppose now that there exist disjoint subsets /,J of {1,...,p}, whose 
union is not all of {1,...,p}, such that Dieym:i = Njeym;. We show, by distin- 
guishing cases, that under these hypotheses P(n) contains a cover-preserving subset 
isomorphic to the ordered set illustrated in Figure 2.10 (its bottom being a number 
partition with four parts), whence P(n) itself cannot be a lattice. We first consider 
the case |I| > 1, say to € J, 


MI —io a > M4, 
i€l—{io} 


my = Sm. 


jed 
As IUJ # {1,...,p}, we can consider also 
mK = > MK. 
kE{1,....p}-IUJ 

Now take the number partitions 

Mig + Mr, My, My-io 
and 

My+MK, Mig, My-ip- 


They are distinct: my # mj; + mx and, since m, + My_-i, == Mz, we have m; £ 
Mi, My_i,. They are noncomparable in P(n), for they both have three parts. Each of 
these number partitions covers m;,,™~i,, My, mK. Each of these number partitions 
is covered by the number partitions 


Mig + My-ig + MK, MI (- Mig + MI~in = ym) 
i€l 
and 
My-in, (Mig +mK)+my (= (my +mMxK) + mi). 
These are distinct for mz > m;_;,, and my < my + mK + ™mj,. Since both have two 
parts, they must be noncomparable. Each, in turn, is covered by top (= n). This 
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6-element cover-preserving subset of P(n) is isomorphic to the non-lattice in Figure 
2.10, so, in this case, P(n) is not a lattice. 

We may suppose now that m,...m, € P(n) satisfies Viepm; = Vyjes my, for 
some I,J C {1,...,p} such that 1A J = 0 and |J U J| < p, only in the case 
|7| = |J| = 1. Say m; = m;, 2 # 7. Suppose there are two other parts mz, # Mx, - 
Let mn = n— (m+ mj; + m,,). Consider the two number partitions 


™My;, mM; + Mk 5 MK 


and 
Mi, Mi + MK, Me. 


They are distinct: 

m5 af Mko Zz Mi, Mm; + Mk, kg - 
They are noncomparable in P(n), for they both have three parts. Each of these 
number partitions covers the number partition m;,™m,;,7Mx,, Mx. Each has both 


Mi, Miz + Me, + MK 


and 

Mmtm, (=M;+mM,), Mt+mMm« (=m+mMx) 
as upper covers. These upper covers are distinct for m; + me, + MK # m; + 
Mkg,™M%; + MK and noncomparable, for each has precisely two parts. These four 
number partitions, along with bottom = mi,mj;,m,z,,mx and top = n form a 6- 
element cover-preserving lattice in P(n). 

If all other parts mz, k # i,j, of this number partition m,...mp, are identical, 
My = My for all k,k’ € {1,...,p} \ {2,7}, then take one part m,, and, for another 
part, take the partial sum of the remaining terms mx = Y),4i,j4,7™r- From the 
hypothesis that p > 5 we deduce that m,, # mx and again we can construct a 
cover-preserving subset of P(n) which is not a lattice: bottom = mi,m,;,™Mk.,Mk, 
covered by each of the number partitions 


Mi, Ms + Me, MK 


and 
Mm, mM; ce MK , ko - 
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In turn, these are both covered by each of 
mi, ™; + TM ko + MK 


and 
m+ mz, (= mMj+M~), Mi +MK (=m, + mx) 
and top = n. This completes the proofs of (i) and (ii). 

We turn finally to the proof of (iii). First of all, to see that P(n) does contain an up 
set isomorphic to II(log n), just take the number partition consisting of the positive 
integers 1,2,2?,...,2°,... which satisfies the conditions set out in (i). (Compare 
this with the Bachet problem [152].) 

Let ¢ stand for the lowest level (equivalently, the greatest number of parts) of a 
number partition of n whose up set in P(n) is isomorphic to the set partition lattice 
II(t). Notice that P(n) has |n/2] number partitions with two parts, while II(t) has 
2'-! _ ] set partitions (of a t-element set) with two parts. As II(t) is an up set in 
P(n), we conclude that 

i-1 a 
m< IF 


Another proof of (iii) can be fashioned on this interesting idea. Label the singleton 
subsets of the hypercube 2‘ by the consecutive ¢ parts of the number partition whose 
up set in /?(n) is isomorphic to II(t). Label an arbitrary element. of this hypercube 
by the ‘non-repeating’ partial sums of the (singleton) parts that lie beneath it in 2°. 
According to the condition (i) this is well-defined, and the sum (n) of all the parts 
cannot exceed the top element of this t-dimensional hypercube. 


which completes the proof. O 


PROOF OF PROPOSITION 2.2. Let m),...™m, be a number partition of n whose 
down set in P(n) is a lattice. By inspection (cf. Figure 2.6 taking the number of 
partitions with at least one part equal to 1) P(5) is not a lattice and, as P(m) is 
isomorphic to a down set of P(n), for every m < n it follows that m; < 4 for all 
a= 1,2,...,p. If the number partition has at least one part m; = 4, then every 
other part must be | for, if even one part were 2, then the down set would contain 
a subset isomorphic to the down set of all number partitions below 24 in P(6)(cf. 
again Figure 2.6); thus, in this case, p > n — 3 > |(2/3)n], and, it is easy to verify 
that the lattice is distributive. Indeed, it is the linear sum of a chain of length 
n—3 and a copy of 2?, the two-dimensional hypercube (all of which is a distributive 
lattice). 

We may suppose then that the largest part is 3. We can verify that the down 
set of number partitions below any number partition 1'2/3* is itself a lattice. In 
fact, such a number partition has precisely two lower covers: 1'+?2/-13* (provided 
jy > 1) and 1°+'27+13*-1 (provided k > 1). This lattice is the linear sum of the 
singleton chain, |n/3|— 1 copies of 2?, and another singleton chain (provided n = 3 
(mod 1)), which, of course, is distributive, and has length |(2/3)n|. This completes 
the proof. 


PROOF OF THEOREM 2.7. The C-order C(n) is, of course, contained in the re- 
finement order of P(n), that is, if number partitions m,...™mMp < mj...mj, in C(n), 


then m,...Mp < m{...mj, in P(n). Moreover, the size of every maximal chain 
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623? 333 


1723 


FIGURE 2.12 


in C(n) is identical, and identical to the size of every maximal chain in P(n), for 
between any two comparable number partitions of types p < q there is always a 
number partition of type p + 1, both in C(n) and in P(n). Thus, every covering 
edge (that is, every edge in the upward drawing) in C(n) is actually a covering edge 
in P(n). 

It is convenient to verify the lattice property using the following criterion: a finite 
ordered set with top and bottom is a lattice if and only if every pair of elements 
covering a common element has supremum |197]. To this end we observe that C(n) 
has bottom = 1” and top = n. 

Let m,...m, € C(n). All upper covers have type p — 1; indeed, for a distinct 
pair of upper covers there must be indices 1 < i < 7 < psuch that 


Mi +t Mi¢, LS Mize 
and 


M5 + M41 S Mj 42. 


A common upper bound of the number partitions 

My, ...™Mj-17; + Miz} +++ Mp 
and 

My... ™M;_1™MN; af M41 +++Mp 


will have, in the ith place, a part at least m; + mj, and, in the jth place, a part at 
least m; +m 41. If 2+ 1 <j, then the number partition 


My... Mg + Mig, ©... MG + M541... Mp 


is such an upper bound, of type p — 2 and must, therefore, be the supremum of the 
prescribed pair of upper covers of ™...7p. 
Let us suppose, then, that7+1= 7. If p>71+3 and 
Mi + Miz, + Mize < Miz3 
then 
my, ...™; + Miz, + M42... Mp 
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is a common upper bound of the number partitions covering m,...™m, and, as such, 
has type p— 2: it is the required supremum. If p = 7+ 2, then m...m;+misiMi4e2 
is the required supremum. Thus, we suppose that p > 7+ 3 and mi + mi41 + Miz2 > 
miyz3. If p= 27+ 3, then my...m + Miz) + Mize + Miy3 Must be the supremum. 
Thus, suppose that p >2 + 3. 

Let m} ...mj, be the minimal common upper bound of the prescribed pair of upper 
covers of m,...™mp,. Such an upper bound satisfies m = mj, mi = me,...,mi_1 = 
Mi-1, and 

m, > mM; + Miz, + Mize. 

We give a procedure for the construction of the supremum by considering succes- 
sively the additive conditions on consecutive terms required for comparability in the 
C-order. In this way we impose successive conditions on the minimal common upper 
bound which, in the end, determine it uniquely, thus giving the required supremum. 
As mj + Miz, < Mize and Mj = Miz) + Mize < Mizs, it follows that 


Mi + Miz, + Mize < Miz3g + Mizg. 


If p=i4 4, then m,...mj + mis1 + MizoMiz3 + Miya is the supremum. Otherwise 
p>it sd. Then miz3 + Miza < Mizs, and 


My... + Migi + MiqeMi43 + Mip4g... Mp 


is the supremum. Otherwise mi43+™Miy4 > M45. Then p > i+6 and miy3+Miz4 < 
Mit5s + Mize. 

Continuing in this way we derive necessary additive conditions producing a num- 
ber partition which, in C(n), is a common upper bound. The procedure terminates 
by producing the supremum of the two upper covers of m,...m , in C(n) and this 
completes the proof. 


PROOF OF THEOREM 2.8. Let t(n,r) stand for the smallest integer ¢ for which 
there is a number partition m, ...m;, such that, for every number partition mj ...m{. 
with r parts, m,...m, < m,...mj, in the C-order C(n). 


We show first that 
—1 
AG OP ee pa ed. 
T 


To see this, observe that the largest part m/, of m{...m{. satisfies 


Z|) <misn-r th. 
T 


Then the largest part m, of mm2...m, satisfies 


mi < H . 
r 
Now, m}. = Dizi) Mi and, as long as m,, < n—r + 1, there is a number partition 
with r parts whose largest part has value mj, + 1. It follows that mf. = Di scm 41) Ms 
where i(mj. + 1) < i(m,). In particular, i(m), + 1) = i(m,) — 1 and mim:41) = 1. So 
m,; = 1 for all 7 < i(m{. + 1) As the largest part of a number partition of P.(n) can 
be as small as |[n/r| we conclude that the required number partition m...m, has 
the precise form 1'~!|n/r|. We deduce that t < [n(r —1)/r] +1. 


96 2. EXTREMAL PROBLEMS ON PACKABILITY OF NUMBER PARTITIONS 


Next we show that there is an order embedding of the subset of all number parti- 
tions above m,...™; in C(n) into the (t — 1)-dimensional hypercube 2‘~!. To this 
end we associate t-tuples of zeros and ones to this up set of number partitions, as 
follows: m,...m; is associated with the zero sequence (0,...,0) and a number par- 
tition m}.. 7m, > m,...7™ has a | in the zth place if in constructing this number 
partition from m,...m, the sum m,; + ™m;4) needs to be taken (otherwise 0 in this 
place). This a one-to-one, order-preserving, and height-preserving map of this up 
set into 2!}. 

We can now complete the proof. The set P.(n) of number partitions, each of type 
r,is an antichain (level) in the up set above m, ...m, in C(n), and this antichain, in 
turn, is associated with a subset of the (t—r)th level of the corresponding hypercube 
2‘-!. Tn conclusion, 


Ss = peel) 
mins (tt) = (tot) (FN, Oo 


9. Problems and assertions 


In this section we give some well-known results (in the form of problems and 
assertions requiring proofs) and open problems, which are marked by (?). Try to 
prove these results and to answer the questions posed. 


PROBLEM 2.1. Suppose you stay in a Russian hotel. What is the least amount 
of banknotes (with values Rbls 1, 3, 5, 10, and 25) you will need to pay a cleaning 
lady, a nurse, a laundress, and a plumber if your total bill for all services is Rbl 37? 
How to settle the problem if the ladies will ask for Rbls 3, 5 and 5, respectively? 


PROBLEM 2.2. A partition (k’) can be packed into a partition (n,,...,n,) if and 
only if 


Try to consider the dual problem: what partitions (k,,...,k,) can be packed into 
a partition (n")? 


PROBLEM 2.3. Let na(k,t,r) be the largest n for which there is no partition of 
a number k with ¢ parts that can be packed in any partition of a number n with r 
parts. Then 
na(k,t,r) = max{k-—1,k-—1—t4+r}. 
PROBLEM 2.4. If n,(k,t,r) is the smallest n for which Vp € P,(n) Vq € Pi(k) : 
p Z q, then n(k,t,r) = max{k+1,k+1—t4+r}. 


PROBLEM 2.5. Let n-(q1,---,@37) be the smallest n such that no partition of n 
with r parts can be packed into the partition (q1,...,q@), 91 >--: > q. Then 


min(r,t) 
neQs---.gsr)=1+ Do 4H. 
i=l 
PROBLEM 2.6. If g € P,(k) and na(q;r) is the largest n for which a partition of q 
cannot be packed in any partition of n with r parts, then ng(q;r) = max{k— 1,k— 
Ptr}. 
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PROBLEM 2.7. If ne(k,t,r) is the smallest n for which Vp € P;(n) Vq € P;(k) : 
q C p, then ne(k,t,r) = max{k,r(k — t) + 1}. 


PROBLEM 2.8. If ns(k,t,r) is the largest n for which Vp € P:(n)Vq € Pi(k): pc 
q, then n;(k,t,r) = min{k, |k/t| + r — 1}. 


One of the most fundamental open problems in the theme of extremal number 
partitions is the following. 


PROBLEM 2.9. (?) For given m1,...,m,r, ¢, compute k(m,...,n,;t), the largest k 
for which V(k,...,k,) / & there holds the packing (k),..., kj) C (m,..., Mr). 
PROBLEM 2.10. A partition (ki,...,k,) can be packed into every partition in 


P,(n) if and only if the partition (/k,,...,lk,;) can be packed into every partition in 
P,(nl + (r — 1)(1 — 1)). 

PROBLEM 2.11. A partition (k),...,k,) can be packed into every partition in 
P,(n) if and only if the partition (k{,...,k/) can be packed into every partition in 
Pat4i(nl). 

PROBLEM 2.12. If a partition (k,,...,k,) can be packed into every partition in 
P,(n), then the partition (k{,...,k!) can be packed into every partition in P.(nl + 
(r — 1)( — 1)). 

PROBLEM 2.13. If, for natural numbers r, t, ny,...,N-,k; > --: > kj the inequal- 
ity 

r ni a 

; at ms a2 > =f ae 

max (>! | + (r—1)(l | 2 max {oh + (r — 1)(k; o| 


Ik; 
holds, then the partition (k,,...,k,) can be packed into the partition (n,...,n,). 
PROBLEM 2.14. If p(ki,...,ki;r) is the smallest p such that (ky,...,k:) C (p"), 


then 
n(ky,..., kis) 
eee carnal |: 


p(ki,..., ker) < 


PROBLEM 2.15. If M(k,t,r) is the largest M such that Vq € P(k) dp € P,(M) : 
q £ p, then for sufficiently large r, 


mitrr= ([ptg] +2) ]Ef 


PROBLEM 2.16. If N(k,t,r) is the smallest N such that Vp € P.(N) dq € P,(k): 
p cq, then 


N(t,k,r) =k+ (|£|- 7 max{0,r—t}. 


PROBLEM 2.17. For any allocation of n indistinguishable particles over r indistin- 
guishable boxes there are r groups of particles (with n/(2r) particles in each group) 
lying completely in the boxes. 


PROBLEM 2.18. For any allocation of n indistinguishable particles over r indis- 
tinguishable boxes there are t groups of particles (with [(n + r— 1)/(t + r — 1)] 
particles in each group) lying completely in the boxes. 
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PROBLEM 2.19. If k = n(ki,...,k¢;r), then every allocation of k particles over 
r boxes can be realized by ¢ groups of particles with k; particles in group j (7 = 
1,...,¢), under the condition that each group lies completely in a single box. 


PROBLEM 2.20. If the inequalities n, > --- > n, hold for a partition (nj,...,n,) F 
n, then 


{(pi,..+,Pr) € P,(n) : DE Tes (i aware i CS 
CA Wiest Pe) SF) iy ype) Sia), 


{(p1,.--;Pe) © Bn): pi Sn, §=1,.0.47 — TC 
Ct issss. De) CRAG) > yews) & ies}. 


PROBLEM 2.21. Counter service problem. In some city a cooperative confection 
bar is opened with one long counter. It is known that civilians enter the bar by 
family (with k; persons in family 7), and that there are t families. Each family is 
seated at the counter, taking seats in order and not leaving empty seats at both 
sides simultaneously. Being seated, those who wish may leave, but may come in 
anew. How many seats must the counter have to prevent queues to form? 

If the cooperation includes an administer who seats the guests to his own prefer- 
ences (not separating families), then how many seats can be spared by him? 


PROBLEM 2.22. Transportation problem. Suppose we have p warehouses, in each 
of which m,; units of goods of a single kind are kept (i = 1,...,p). Suppose also 
we have qg customers, each of which can use n; units of goods from the warehouses 
(7 = 1,...,q), and suppose, for the sake of being specific, that m, +--+ +m, = 
nm, +++: +Mq =n. What is the smallest amount of transports necessary to bring all 
goods from the warehouses to the customers? A single transport is assumed to take 
a good from one warehouse to a customer. 


PROBLEM 2.23. For a natural number n we let s(n) be the number of natural 
numbers m (m > 2) for which m — 1 divides [n(m — 1)/m] without remainder, and 
we let d(n) be the number of divisors of n. Show that n — 1 is a prime if a nd only 
if d(n) = s(n). Show that n — 1 and n + | are both primes (‘twins’) if and only if 
2d(n) = s(n) + s(n + 1). 

Hint. Prove and use the equation d(n — 1) + d(n) = s(n) + 2. 


PROBLEM 2.24. Show that the recurrence kj = ki, + ki-e (ky = 1, kp = 2) and 
the recurrence 


define one and the same sequence. 
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PROBLEM 2.25. Try to verify for the Frobenius numbers s = s(n,n + 1,n + p) 
(p > 1) that ifn > p(p-— 4) + lort(p—-1)-2<n<tp+1,0<t< p-—3, then 


ntl 


S= 1p = 3) Flin sy-14@-H] 


n 0, n =p-—1 (mod p), 
=n{(p—24+ |-]]-1- 2 
p n—p|2], n#p—1 (mod p). 
For the latest state of the solution of the Frobenius problem we refer to work of 


I. Kan in the Russian journal ‘Discrete Mathematics’ a 


PROBLEM 2.26. Show that ifn =i (mod p),i=0,...,p—1, and f(n) = zi, then 


10 a(t) 


(This useful fact was also noted by I. Kan.) 


PROBLEM 2.27. Draw the Hasse diagram of P2, and try to compute its Mdbius 
function. 


CHAPTER 3 


Extremal problems on graphs and systems of sets 


This chapter, which is devoted to extremal problems on graphs and systems of sets, 
seems at first glance unrelated with the material of the previous chapter, which was 
concerned with extremal problems on number partitions. However, it turns out that 
a whole class of problems on extremal properties of graphs (more precisely, the class 
of problems on local properties of graphs) can in essence be reduced to extremal 
problems on number partitions, and moreover precisely to such problems as have 
been considered and solved in Chapter 2. 

We introduce the necessary notation. By S, = {a1,...,@n}, and S = {a,...}, we 
denote the indexed, respectively nonindexed, set of vertices of graphs or hypergraphs. 
A subset of vertices is said to be independent in the graph if no pair of vertices in 
this subset is joined by an edge of this graph. 

By G?(S,) we denote an arbitrary graph with vertex set S,, and by G2 a graph 
on a certain set of n vertices; thus, G?(S,) C C?(S,) (for the definition of C*(S,) 
see Chapter 1, §1.4). We also use the following notations for special kinds of graphs: 


K,, is the complete graph on n vertices, i.e. K, = C?(S)); 

Ky q is the complete bipartite graph on two vertex sets (the parts) with p 
and q vertices in each part, respectively; so,, if S$, 5S, = 0, then Kpg = 
CNS.) C*(S,); 

Zn is a star, i.e. a bipartite graph with one singleton vertex part (Zp = Kin); 
G = C?(S)\G is the graph complementary to the graph G on the vertex set 
S. For example, the graph complementary to the complete bipartite graph 
Kpq is the graph K, + Kg (the graph consisting of two complete subgraphs 
on two disjoint subsets with p and q vertices, respectively). 

For S Cc S, we denote by G(S) the subgraph of G(S,) induced (or generated) 
by the vertex subset S, i.e. consisting of precisely those edges of G(S,) that join 
vertices from S; so, 

G(S) = G(S,,):0.C7(S). 

Two edges of a graph are said to be independent if they nonadjacent, i.e. do not 
have common vertices. A system of pairwise independent edges is called a pairing. 

F, denotes a k-vertex graph with [k/2] independent edges (a pairing); 

Fj, is a pairing with a ‘fork’, ie. a k-vertex graph with |k/2/ independent (if 
possible) edges; 
C; is a simple cycle on k vertices; 


101 


102 3. EXTREMAL PROBLEMS ON GRAPHS AND SYSTEMS OF SETS 


P, is a simple path on k vertices. 


The chromatic number x(G(S,)) of an n-vertex graph G(S,) is defined as the 
smallest possible amount of colors with which the vertices of S, can be colored such 
that vertices joined by an edge in G(S,) are given different colors. In other words, 
x(G(S;,)) is the smallest integer . for which there is a map ¢: S, — [x] = {1,...,x} 
such that for each ‘color’ i € [y] its complete pre-image under this map, $7! (i) = 
{a € Sy: o(a) = i} C Sz, is an independent subset of vertices of G(S,,). Hence, 
for computing the chromatic number of a graph we have to find a partition of the 
vertex set having least possible rank, under the above-stated conditions on such a 
partition. 

A graph is said to be connected if each pair of vertices is joined by a path of graph 
edges. 

A tree is a connected graph without cycles. 

A wood is a graph in which each connectedness component is a tree. 

A hypergraph on a vertex set S is a subset G of P(S). The elements of G are 
subsets e C S, and are called hyperedges. Thus, each hypergraph G is a set {e:}i<icm 
of m = |G| hyperedges e; C S on the vertices in S. An l-graph is a set G' c C"(S), 
so that an ordinary graph is a 2-graph; its elements are simply edges. It is sometimes 
convenient not to give the vertex set; e.g., a hypergraph F can be specified by its 
hyperedges {Aj,..., Am}, whose order is, in general, immaterial. 

For XS = @ the notation C*(X)-C'(S) denotes the (k +/)-graph on the vertex 
set X + S, of the form {e Cc X + S: |eN X| =k, |eN S| =|}. 

A system of sets is a hyperedge of a hypergraph, with the distinction that in a 
hypergraph all hyperedges are distinct, while the elements of a system of sets may 
repeat. Thus, a system of sets is a multihypergraph. 

By the term sufficient construct we understand either a graph, a hypergraph, or 
a system of sets, satisfying prescribed conditions; a sufficient construct is said to be 
an extremal construct if it is possible in the limit for certain parameters or structural 
characteristics. For example, the complete n-vertex graph is an extremal construct 
with largest possible number of edges among all n-vertex graphs. 


1. The theorems of Mantel, Turan, and Sperner 


Historically the first extremal result on graphs is the following theorem. 


MANTEL’S THEOREM. The largest possible number of edges in an n-vertex graph 
without triangles is equal to [n? /4]. 


The extremal construct is unique, and is the complete bipartite graph with (if 
possible) equal parts, i.e. with [n/2] and |n/2[ = n — [n/2} vertices in the first and 
second part, respectively. In turn, an immediate verification shows that [n/2]-]n/2| = 
[n? /4]. 

It is obvious that, according to the box principle, for any three vertices of this 
graph there is pair of vertices belonging to one part, i.e. a pair not joined by an 
edge. Hence there is no triangle K3 on these vertices. 

The following is a generalization of this. 
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TURAN’S THEOREM. Let T(n,k, 2) be the smallest possible number of edges in an 
n-verter graph such that for any k vertices there is at least one edge. Then 


s? (el 
Tak y= >" ( Bod ). 
io \ 2 

The extremal construct is unique, and is a system of k— 1 complete graphs with (if 
possible) equal number of vertices (with [(n +7)/(k — 1)] vertices in the ith complete 
graph, z= 0,...,/ —2). 

According to the box principle, among any k vertices there is a pair of vertices 
belonging to one of these complete subgraphs, and hence a joining edge. It is obvious 
in what way these two theorems are related. If a graph G(S,,) has the property 
expressed in Mantel’s theorem, i.e. K3 ¢ G(S,), then this is clearly equivalent to 
the fact that in the complemented graph G = C?(S,,) \ G(S,) there is between any 
three vertices at least one edge. Hence, T'(n, 3,2) = (3) — [n?/4]. 

The similar problem for uniform hypergraphs is open, and is called 


TURAN’S PROBLEM. Consider uniform /-graphs G!, c C'(S,) on the vertex set 
Sn, i.e. hypergraphs with as edges l-element subsets of the vertex set S,. Let 
T(n,k,l) be the smallest possible number of l-edges in the n-vertex graph GC 
C'(S,) such that 

VSk ear AS) € a : hep | Si 
The problem is to calculate T'(n, k,l) for all admissible parameters n > k >1 > 1. 


Besides Turan’s theorem, only some particular solutions of this general problem 
are known, namely: 
—k)l 
RD sree Se 
" . 
n—l\? 


T(n,n—1,)) =| 


nin—-1l 
rman Stl 
a aa 
Mon) — 3k, 4 <8 < 4, Leven 
ee TGA. ee Sle 
3n — | 1 E mi S&S ea ! odd; 
Boe SS ees 
3n — 4k B3<n <9 
T yee 1,3 = , 2-—-k-*™ 
(n ) 4n — 6k, 2<2<2, nf¥ M1 
4n — 6k + 2, n = Seta, d= 1,2. 


There is a very attractive conjecture of Turan, stating that 7'(2n,5,3) = 71 
where this value is realized by the 3-graph K3 + K3 consisting of two complete 3- 
graphs on n vertices each. However, even this particular problem is still open up till 
now. 

More information on Turan numbers can be found in [204] 

Altogether, the origin of extremal problems on systems of sets is the following 
result, now known as Sperner’s theorem. 
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SPERNER’S THEOREM. The largest number of subsets of an n-element set which 
do not contain each other is equal to 


(win): 


An extremal construct realizing this value is, e.g., the set of all [n/2|-element 
subsets of an n-element set, since all subsets of equal cardinality are pairwise not 
included in each other. 

We now dwell on some approaches to the solution of these and related extremal 
problems. The means most often used is that of two-sided estimation, which we will 
conveniently demonstrate on a simple concrete example. 


EXAMPLE 3.1. How many hyperedges can an n-vertex hypergraph without non- 
intersecting hyperedges have? 

If G Cc P(S,) is a sufficient construct and e € G, then, clearly, (S, \e) ¢ G, hence 
G cannot contain more than half the number of all possible hyperedges, i.e. 


IG| < Poh = = ot, 


On the other hand, for a € S, the hypergraph C(a)-P(S, \ a) has the required 
property, and has number of edges equal to 


|C(a) -P(Sn \ a)| = |C(a)| -|P(Sn \a)| = 12°71 = ae". 


At the basis of the majority of estimation means lies the use of various cardinal 
relations for systems of sets. 


LEMMA 3.1. If a hypergraph F and a system of hypergraphs W = {G,...} are 
such that VG EW: |GN F| <1, then 


>" degy(S) rae 
ser |W 


where degy(S) = |{GEew: Se G}|. 


PROOF. Recall that for a hypergraph G, vertex set A, and nonnegative integer q 
the valency is defined as the number v(A,q,G) = |{e € G: |ANe| = q}|. If G and 
F are two systems of sets, then 


»» v(A,q, G) = S- v(B,q,F). 


AEP BEG 
In fact, 
S> v(A,g,G) = S; {BeG: |ANB|=q}|= 
AEP AEG 
=) DV xIAnBlag= 5 YE x{lAnBl=q = 
AEF BEG BEG AGF 


= [{AeG: |ANBl == > o(B,¢,F). 


BEG BEG 
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It is obvious that if S C S,, then v(S,q;C'(S,)) = Gy cea therefore for 
Gc G'(S,) the above equation gives 


l\ {n-l 
u(Saict) = (1) ( Jie 
om, ) ; a) | 


If G is a hypergraph and S is a vertex set, then 
> dege(a) = dU len SI, 
aes e€G 


where degg(a) = |{e € G: a € e}| is the degree of the vertex a in the hypergraph 
G. Indeed, 


So degg(a) = So fee G: a€e}|/= >> >> xfa ce} = 


aes aes aeS e€G 
= YS x{faee}= dV lenS|. 
e€G aceS eEG 
Hence, if we now take S to be a hypergraph F’, and G a system of hypergraphs 
W ={G,...}, respectively, we obtain the identity 


> degw(S) = IGN FI, 


SEF Gew 
which implies the required inequality since VC € W: |GN F| <1. O 
In particular, this lemma allows us to prove Sperner’s theorem. The following holds. 


COROLLARY 3.1. If a hypergraph F C P(S,) has the property that VA, Be F: 
A ¢ B, then the following inequality holds: 


pe eee [A]! ne = 


PROOF. It suffices to take in Lemma 3.1 a hypergraph F' satisfying VA, B € F: 
A ¢£ B, and to take for W the set of all hypergraphs G each of which can be 
represented by a complete chain of the form G = {{So}, {Sy},..., {Sr}}; such a 
hypergraph is a chain if {So C S; C --- C S,}. It is obvious that the number of 
such chains in P(S,) is n!, hence |W| = n!. Since each hypergraph G is a chain, we 
have 


a 


YVGEW: |GNF| <1. 
Finally, it is easy to verify that degy(S) = |S|!(n—|S|)!. O 
Now Sperner’s theorem follows immediately from Corollary 3.1, since 


ej As [n/2])) 5: ae [n/2])! ay as Ae 


! 
me A€F n! A€F 
and hence 


I (iyi) 


In a number of cases it is possible to obtain an exact solution, without turning to 
two-sided estimation. This has been realized for monotone properties of graphs and 
hypergraphs. A property A is said to be monotone (hereditary) if the fact that a 
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graph G has this property (written as G € A) implies that every subgraph F’ of it 
also has this property, i.e. A is monotone if and only ifGeA>VFCG: FEA. 
It turns out that for certain monotone properties it suffices to compute an exact 
solution for one concrete value of n, after which we verify the rate of growth of the 
extremal bound looked for as a function of n: if this function satisfies well-known 
analytical conditions, then it is also the required extremal bound. The basis for this 
method is given by the following lemma. | 


LEMMA 3.2. Suppose we can compute, for an arbitrary monotone property A, the 
quantity 
l; A) = ‘Gal 
F(t; A) = max |G, 


If there exists an integral-valued function f(n) such that 


f(no) = f(no,!; A), no = l, (3.1) 
0< f(n) < f(n,l; A), n> No, (3.2) 
ae Fi =| . n> No, (3.3) 


then for alln > no we have 


f(n,l; A) = f(n). 
PROOF. Since A is monotone, we have 
GS,) € A = VSn_-1 C Sn: GS-1) = G(Sn) NC(Sp1) € A, 
and hence VSp—1 C Sn: |G'(Sp—1)| < f(n—1,1, A). Since for any l-graph G'(S,,) we 


have 

dD |@(Sn-1)| = (n — 1) - |G(Sn)I, 

Sn—1CSn 
we find that if G'(S,,) is an extremal construct, then the inequality f(n,l;A) < 
Inf(n — 1,l; A)/(n — 1)| holds. We now prove the required assertion by induction 
with respect to n > no. For n = 1 it is true by (3.1). Suppose it is true up to n— 1, 
inclusive. We show that it is also true for n. By the induction hypothesis we have 
n- f(n—1,1; A) n-f(n-1 
fin) < flmt,A) < [PATA eT) < Fe, 
n—l n—l 

Here, the first inequality follows from (3.2), the second has been proved above, the 
equality is the result of the induction hypothesis, and the last inequality follows 
from (3.3). O 


We demonstrate the application of this lemma on some concrete examples. The 
function f(n) = [n?/4] satisfies (3.3) for all n > 2. Hence, if the property A is 
that the graph does not contain triangles, then the complete bipartite graph with 
(if possible) equal parts is in fact the extremal construct in Mantel’s theorem, since 
conditions (3.1), (3.2) clearly hold for no = 2. Thus, Mantel’s theorem follows 
immediately from Lemma 3.2. 
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The verification of the threshold value no in the lemma is of utmost importance. 
To convince ourselves of this, we consider another monotone property. Let A be the 
property that the graph does not contain pentagons, i.e. 5-vertex cycles. Put 


f(n; Cs) = qnax IG rl; 


we show that 
) if2<n<4, 
f(n; Cs) = <7 iba =D, 
[|] ifn >6. 
For n < 5 the extremal value can easily be verified directly, and hence, although 
the function f(n) = |[n?/4] satisfies (3.3) for all n > 2, we find that the threshold 
value for this A is no = 6. For n = 5 an extremal construct is the graph K, 


completed by one more edge, incident with the fifth vertex; for n > 6 an extremal 
construct is the same complete bipartite graph as in Mantel’s theorem. 


2. Forbidden subgraphs and local properties 


Mantel’s result may serve as a model example for a large class of extremal prob- 
lems, whose general form is as follows. 

For a fixed graph F' we compute the largest possible number of edges in an n- 
vertex graph not containing F’ as a subgraph. 

It is, however, convenient to consider a somewhat more general statement, which 
we call the problem on forbidden subgraphs. 


PROBLEM ON FORBIDDEN SUBGRAPHS. Let L = {G,...} bea list of fixed ‘for- 
bidden’ subgraphs. It is required to compute the quantity f(n;L) equal to the 
largest possible number of edges of an n-vertex graph not containing as a subgraph 
any graph on the list L. 


In the light of this formulation, Turan’s result takes the form 


f(n, Ke) = @ — T(n,k, 2). 


Well-known formulas for certain lists of forbidden subgraphs are given in the 
Problem Section. 

The following theorem gives an asymptotic solution of the problem on forbidden 
subgraphs. 


ERDOS--SHIMONOVICH THEOREM. 


1 os 2 
flu B) = (1+ a) 4 olny 


It must be noted that the main characteristic determining the coefficient at the 
principal term of the asymptotic expansion in this Theorem is the chromatic number, 
i.e. a graph characteristic which is difficult to compute. 

The Erdés-Shimonovich theorem makes clear that the coefficient at the principal 
term of the asymptotic expansion is distinct from zero if and only if the list of 
forbidden subgraphs does not contain two-colorable ones, i.e. graphs colorable with 
two colors. The case when two-colorable forbidden subgraphs are on this list is 
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termed ‘degenerate’, and in this case the problem of computing f(n; L) becomes 
that of estimating the remainder o(n?). The region of ‘most degenerated’ cases has 
been studied. 

The equality f(n;L) = O(n) holds if and only if the list of forbidden subgraphs 
contains either a tree, or a wood (for a finite L). 

The equality f(n; L) = O(1) holds if and only if the list of forbidden subgraphs 
contains pairings and stars. 

As an example we show that 


f(n; Ks) < 0.5-(s — 1)/"n?-/" + O(n), r<s, no. 


If d,...,dy are the vertex degrees of a graph G, then the number of stars K, in 


G equals 
n d; 
= (7) 


If G does not contain the complete bipartite graph K,,, as a subgraph, then for each 
r vertices there can be at most s — 1 adjacent, hence the total number of stars K;,.; 


cannot exceed (s — 1) @); and the inequality 


z (7) s¢-n() 


implies the required inequality: 
2|G| = Dodi < (s—1)'/"n?-" + O(n). 
i=l 


For other ‘degenerate’ cases see the Problem Section of this Chapter. 
The same result of Mantel may serve as a model example of another class of 
extremal problems, whose general form is as follows. 


PROBLEM ON LOCAL PROPERTIES. For a fixed k-vertex graph Hx, compute 
m(n; H,), i.e. the smallest possible number of edges in an n-vertex graph G,, each 
k-vertex subgraph of which contains a subgraph isomorphic to Hx. 


The class of problems on local properties is a subclass of the class of problems on 
forbidden subgraphs; this inclusion gives the obvious equation 


m(n; He) = é )- f(n; {Ge : Ge Z He})- 


Let p(n; Fi.) be the maximal number of edges in an n-vertex graph such that any 
k-vertex subgraph is imbedded in the k-vertex graph F;,. Clearly, 


m(n; He) + p(n; He) = (") . 


therefore in the sequel we assume that Fy, = Hx, and we will choose for our conve- 
nience one or the other notation. 
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3. Exact solutions for local properties of graphs 


In this Section we give the solution of certain concrete extremal problems on local 
properties of graphs. We introduce some helpful notation. Let A(G) be the maximal 
degree of the graph G. Let Z(a) be the set of vertices adjacent to a. Let t(G) be 
the largest. number of independent edges in G. 


THEOREM 3.1. Let Hy be an arbitrary fixed k-vertex graph having vertex degree 
k —1, and let Fy, = Ky. Let Fy be the pairing on k vertices. Then a graph Gy has 
the property that each of its induced k-subgraphs contains Hx, i.e. 


VSu C Sa: GaSe) = C?(S:) Ga D Hp, 


if and only if Gn D Kn — Fy for Fy DB Fe and Gn D Kn — Fe or Gn D Kn — Fr for 
ry rE: 


PROOF. The sufficiency of this assertion is obvious. We prove its necessity. Sup- 
pose the graph G, has the indicated property. We introduce in the graph comple- 
mentary to it the system F, of t independent edges. 

The inequality ¢ > t(F;) can only be fulfilled if Fy, > Fy, but in this case we must 
have G, = Fa C Fn, since by adding an edge e to Fx (e must be adjacent to an 
edge of Fa, since the latter is maximal) we obtain the graph Fy + {e}, in which 
there are always k vertices S; such that the proper subgraph (Fa; + {e})(S;) induced 
by it does not have isolated vertices, the prescribed graph F, by the condition 


Hy D Ky x-1. 


Let now t < t(F,). In this case the number of nonisolated vertices of the graph 
complementary to G, does not exceed k — 1. In fact, choosing in the opposite case 
for S, © Sp, any set of k nonisolated vertices including the 2é-vertex graph Fo, by 
the maximality of Fa, we find that there is no proper subgraph G,(S;) on these k 
isolated vertices, contradicting the conditions of the theorem. It is obvious that a 
graph on at most k — 1 nonisolated vertices is a subgraph of Fy. O 


As a consequence we obtain the solution of a problem on local properties of graphs. 


COROLLARY 3.2. Suppose the k-vertex graph Fy has an isolated verter. Then 


[Fl if Fe D Fr, 


ee ae \Fel) if Fe > Fe, 


Thus, if in a problem on local properties the graph H; has a vertex of degree 
k; — 1, then the exact solution of this problem is given by Corollary 3.2. 


THEOREM 3.2. m(n; Cy) = |n(n — k + 2)/2|. 


PROOF. It is obvious that for k = 3 the complete graph is extremal, while for 
k, = 4 so is the complete graph without pairings, hence we assume in the sequel that 
k> 65. 

If G, is a sufficient construct, then each of its vertex degrees is at least n — k + 2. 
In fact, if there is a vertex a € S, with degree d(a) < n—k+ 1, then there is a 
vertex subset S, C Sp containing this vertex and such that dgys,)(a) < 1 (here, 
dgs,)(a) is the vertex degree of a in G(S,)). This means that G(S,) D Cy. Since 
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the sum of the vertex degrees equals twice the number of its edges, we find that 
Gal > |n(n — k + 2)/2[. 
For the description of extremal constructs on the vertex set S,, we introduce the 
distance d by the formula 
d(a;,a;) = min(|¢ — j],n — |¢— J). 
The following inequality holds for this distance: If 1 < 4) < +++ < ix <n, then 


d(a;,,@:,) < min (= CGE G35) e > ala.) ; (3.4) 


j=1 J 


The degree cycle Ct on n vertices is defined as the graph Cf, = {(aj,a;): 1 < 
d(a;,a;) < t}, so that C} = C,. Consequently, in Cf, every vertex has degree 


min(2t,n — 1), and 
(Ce) = min (nt, )) 


We show that C® is a sufficient construct for t = [(n — k + 2)/2]. — = 
{agj.0»;0),} G Spy where 1] < 4 <>* <i, One lf dae, ai.) StU = lye kD) 
and d(aj,,ai,) < t, then 


Ch (Sk) ag) Ck = CaHacthis Vis os) (as, 1, i, ), (a:,, i, )}. 


We now consider the opposite situation. Suppose, for the sake of being specific, 
that d(ai,;,a:,,,) 2>¢+ 1. Then, for 1 < ti; < tj41 < t, 


n—k+2 


d(ai;; Diss) < | 2 


ce ) (3.5) 


In fact, by (3.4), 
d(ai, ’ Gi, ) < d(ai, fe) maa Soe d(ai;_, ’ di; ) 4 d(a;; ’ Qi; 1) = d(ai,,,, Wrciea) = 
ae 390) RS) de Bigg) 
whencet +1 <n—k+1+1 — d(aj,,ai,,,). Since the righthand side of (3.5) does 
not exceed ¢ for 1 = 1, we always find 


Ct (Sx) id {(ai,, ip), Oo a (Pemen Tee ya 


Thus, if n — k is even, then for k even, 


Ct (Sr) 22) Cr => {(a:, ’ ig); (G55 0%) caty (Ope pct 4.)5 
Rdg sella) (ae, Er 885 (Qi, ’ Gi) (@ia5 ai,)}, 


while for k odd, 
Cr, (Sk) > Ck = { (ai, ) Gia) (Gigs Gis) ioe) (Oiiccsy Cie), 


Cae sO ls (Bens ’ Gis); sy (ai, ’ Qin), (ai, ’ ai,)}. 
So, if n — k is even, the degree cycle Ct, is extremal. 
Let now n — k be odd. If d(ai,,ai,) > +2, then £+2<n—k+1—d(ai,,ai,,,), 
hence d(ai;,ai,,,) < |(n—k + 2)/2[ + 1 — 3, and for | = 2 this quantity does not 
exceed t. Hence, also in this case C4(S;,) D C,. Thus, it remains to consider the 
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BA +t -7 
FIGURE 3.1 
i 
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1 k+t-s 
FIGURE 3.2 


case d(a;,,a;,) = t+1. Here, if d(ai,;,ai,,,) <¢t (7 = 1,...,k-2), then Ch(S,) D Ci, 
so we may assume that d(ai,,ai,) = ¢ + 1 = d(ai,,ai,,,). But then 
A( ai, , Giz) Sine wn ee d(ai;_,, @i;) ial Gg, ogo) ae ease A( i, ,, ix) = 1. 


Therefore, to shorten notation we assume in the sequel that 


Sk = {Grae lg hig Osha py Ata pi 1 


where 7 +1 <1 <j +t, 1< 7 < [(k —1)/2]. Now we consider several cases. 
Assume that n is even and k is odd. We complete C% by the pairing 


F = {(ai,a;) : d(ai,,ai;) = n/2}, 


and denote the resulting graph by G. We verify its sufficiency. 

Let t > 1,97 < (Kk —1)/2. It is obvious that (a1, @n/241) € G(Sx), and since 
j <(k—-1)/2, wehavet +7 +1<n/2+1<k+t—1. Hence G(S;) > Cy, where 
C, consists of the path 


{(An/2415 ay), (a1, a2), fey (a;, ai), (Qi, Q¢4541), (A453 41) 4542); a) (Qn/2-1, Anj2)} 
and the path spanned on the vertices 
{Qn/2, An/241>--- ,An+t-1} 


and beginning in a@y/2 and ending in ap/241. (The existence of such a path follows 
from the condition ¢t > 1, cf. Figure 3.1.) 

Let ¢ > 1, j = (k—1)/2 and j odd. Then n/2+1 = t+ 79+ 1, and hence 
G(Sz) D Cy, with Cy as in Figure 3.2. 

Let t > 1, 7 = (kK—1)/2 andj even. Then either (a;_1,a;) € G(S,) or (ai, a; 44-2) € 
G(S;). For the sake of begin specific we assume that the first condition holds. But 
then G(S;) D Ck, with Cy as in Figure 3.3. 
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Let t= 1. Then n= k + 1, and the graph 


Cat {(@1,@n/241); (a2, An), (@3,An—1),--+; (An/2,4n/2+2) } 


is extremal. 
In fact, if we remove the vertex a,, then 


Ch-1 anal {(a2,a3), (a3, @4), cee Gn cian): (An, Q2)}, 


and if we remove a vertex a;,j # 1,n/2+1, then C,_, has the shape as in Figure 3.4. 


We assume that n is odd and k is even. We complete Ct, by the pairing with a 
‘fork’ 


F' = {(41, @(n41y/2), (@i,0;) : J-t = (n+ 1)/2,7=1,...,(n —1)/2}, 


and denote the resulting graph by G’. We analyze its sufficiency. 

We first note that G’ does not possess the symmetry of choice of S; that G has, 
hence the mastering of the various S; in G’ had better be replaced by the mastering 
of two distinct pairings with a ‘fork’: the initial one and the following: 


F' = {(ain-1)/2; On), (41,43): 7-4 = (N—1)/2, i= 1,...,(n — 19/2}. 


The equivalence of these masterings is obvious, and their presence makes clear 
the larger amount of variants than in the case of G. We denote the first variant by 
a), and the second by b). 

a) t > 1. Since always 7 < [(k — 1)/2] = (k — 2)/2 < k, we havet+j+1< 
(n + 3)/2 <k+t-—1, and hence G’(S;) D Cy, with Cy as in the case of G 
for even n, odd k, andt > 1,7 < (k — 1)/2. 
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FIGURE 3.6 
b) ¢> 1,7 < (Kk — 2)/2. Since 7 < (k — 2)/2, we have t + 7 +1< (n+ 1)/2 < 
k +¢#— 1, and hence G’(S;) D> Cy, as in the previous case. 
b) ¢> 1,7 = (kK —2)/2, 7 odd. Here G’(S;%) D Cy, with Cy as in Figure 3.5. 
b) t > 1, 7 =: (k — 2)/2, 7 even. Here either (a;-1,a,) € G’ or (az, at4542) € G". 
But then G’(S;) D Cy, with Cy as in Figure 3.6. 
Finally, let ¢ =: 1. Then n = k + 1, and the extremal graph is 


Cr + {(1, @(n43)/2)1 (G1, @(n41)/2), (@2, An), (@3, @n—1), -»» » (@(n—1)/2s @(nt5)/2) }- 
In fact, if we delete the vertex a), then 
Cn—1 = {(@2, a3), (@3,44),.-.-, (@n—1, An, A2)}, 
and if we delete a vertex a;, 7 # 1,(n + 3)/2,(n + 1)/2, then C,_, has the shape as 


in Figure 3.7a), while if a vertex a;, 7 = 1,(n+3)/2, (n+ 1)/2, is deleted, then C,,_; 
has the shape as in Figure 3.7b). O 


We note another consequence, concerning the structure of graphs, of the results 
obtained. We call an n-vertex graph locally Hamiltonian if for a natural number k 
(3 <k <n) every k-vertex proper subgraph of it is Hamiltonian, i.e. is a Hamiltonian 
cycle Cy. As usual, a Hamiltonian path is a simple path passing through all vertices 
of the graph only once. 


COROLLARY 3.3. In a locally Hamiltonian graph there issues from each vertex a 
Hamiltonian path. 


PROOF. Assume the contrary: the locally Hamiltonian graph G, has a vertex 
a, from which no Hamiltonian path issues. Let FP; = {(@1,@2),...,(@i-1,@1)} be 


114 3. EXTREMAL PROBLEMS ON GRAPHS AND SYSTEMS OF SETS 


FIGURE 3.7. ab 


the longest path issuing from a;. It is obvious that k <1! <n-—1. Put S, = 
{az,-.-,@1,---,@tye-1}, Wheren—k+1>t>l—k+1,1 >t > 2 (this can always 
be realized). Since G, is locally Hamiltonian, G,,(S;) D Cy, hence in G,(S,) a 
Hamiltonian path issues from each vertex, and thus also from a;. Consider such a 
path issuing from a,. In G, we then have, setting 


P = {(@1,@2),..., (G1, At) } C Pi, 


a path P, starting at a; and not passing through {a),...,@:-,} (by construction). 
Hence there is a path P, U P; in Gy, issuing from a, through t + k — 1 > 1 vertices, 
contradicting the maximality of Rh. O 


It is remarkable that the formula of Corollary 3.2 remains asymptotically the same, 
even if the volume of the locally Hamiltonian graph is divided by 2. This is expressed 
in the following corollary. 


COROLLARY 3.4. If m(n; F,,) denotes the smallest possible number of edges in an 
n-verter graph such that among any k vertices there 1s a pairing with a ‘fork’, then 


m(n; Fx) = |p = . 


2 


Considering the result of this corollary, P. Erdds became interested in a condition 
for the presence of systems of certain independent subgraphs in a local graph, where 
the systems are taken to be less trivial than a pairing, and in the first instance 
consist of two independent triangles. The following theorems allow us to find such 
conditions. 
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THEOREM 3.3. Forn > 2p +4, 


(Se eg SO, (3.6) 
3] ¢ pa leg, (3.7) 
Roa le eS ogee. (3.8) 
max(p*,n), p> 2,q=0, (3.9) 
Ln; Kpp+q) = 3 [3 ’ P= 2, = I n= 5,7,9, 
(3.10) 
[2], p=2q=1,nF5,7,9, 
(3.11) 
pp. pS sg, = 2p 1, 
(3.12) 
max ((p + 1)?;3 [3|). p>3,qg=1,|n> 2p4+2, 
(3.13) 
p+pq, p>2,q>2. (3.14) 


PROOF. Let f(A,t) be the maximal number of edges in the graph with maximal 
degree at most A and number of independent edges at most t. Then the estimate 
f(A, yt) < t(A + 1) holds, and the following exact formula is true: 


At if A > 2t + 1, 
joan = . 2 
At+ [3] [tat] fA < 2 


We show the formulas (3.6)—(3.14), and give the corresponding equation number 
before proving it. 

(3.6). Trivial. 

(3.7). The problem is equivalent to computing the maximum number of edges in 
an n-vertex graph without triangles, and thus equivalent to Mantel’s theorem, since 
the only extremal construct is the complete bipartite graph with (if possible) equal 
parts. 

(3.8). The sufficient construct G, cannot contain two independent edges, i.e. 
IG,| < n—1, and this value is realized by the star K,,,-1, which is again the unique 
extremal construct. 

(3.9). For p > 1 the maximal degree of Kp pig is p + q < 2p +q-— 1. Hence 
A(G,) < A(Kp pig) = p +49, and hence |G,| < [n(p + q)/2]. Let G, be an arbitrary 
sufficient construct and a € S, a vertex of degree A = A(G,). Since K,, Z Ks, 
we have G, D K3, hence Gr({a} + Z(a)) = Kya. Put S = S, \ {a} \ Z(a), and 
consider the induced (n — A — 1)-vertex subgraph G(S) C Gy. If A > 3, then 
t(G(S)) < p—A. In fact, since Kp» D Ki, + Fap—ayiy, we have G(S) DB Fap—a+1) 
for 2p > 1 + A+ 2p — 2A + 2, or A > 3. Thus, in this case, 


IGn| < A? + |G(S)| < A’ + f(A,p— A) < A? + (p— A)(A+1) = 
=pA+p—-A<p’. 
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If A < 2, then clearly |G,| <n. The extremal constructs are K,, and the Hamil- 
tonian cycle Ch. 

(3.14). Since t(Kppig) = p < [(2p + q)/2], we have t(G,) < p. Thus, |G,| < 
f(p+a,p). Ifp+q2> 2pt+1,org>pt1, then f(p +q,p) = p(p + q), and then 
(3.14) has been proved, since Kp5+q is a sufficient construct. 

We note certain properties of a sufficient construct G,. First we note that x(Gn) = 
2. In fact, Gy, does not have ‘short’ (< 2p + qg) odd cycles which are not imbedded 
in Kppiq as graphs. At the same time, the presence of a ‘long’ (2p + q) cycle 
would imply the presence of more than p independent edges. Hence G, can only 
contain the cycles C4,..., Cap. If A(Gn) < p+q-—1, then |G,| < f(p+q—-l1,p) < 
pip +q—1+4+1) = p(p +q), so that we assume that A(G,) = p(p + q). Let l be the 
largest integer (1 <1 < p) for which Ki,54g C Gn, where the /-part of Ki,p4q consists 
of the vertex set S; and its (p + q)-part of S,,,. Put S,_; = S, \ S;, and consider 
the subgraph induced by these vertices: G(S,_;) C Gy. Since t(Gn) < p, we have 
t(Gr_1) < p—l. If A(G(S,_1)) < p+ q-—1, then 


IG(Sp_-)| < f(p+qa—-1,p-—) <(p-Yiptq-1+)=(p-Y(pt+q, 


and hence 


IGn| = U(p + 9) + |G(Sr-i)| < Mp + 9) + (p —D(p + @) = pp + 9). 
Therefore we assume that A(G(S,_))) = p + q and that a € S,_; is a vertex having 
this degree in G(S,_;). Since A(G(S,)) = p + q, we have a ¢ Syyq. 

Consider the set Z(a). Now Z(a@) = Spyq contradicts the maximality of /, so that 
Z(a)M (Spi \ Sp4q) # 9. Consider an edge (a,b) € G(S,_1) with b € Z(a)N (Sp_v \ 
Spiq) If now 1 <1 < p—2, then Kppig D Kipig + (4,9), i.e. in this case also there 
cannot be a vertex of degree p + g in G(Sp_1). We consider the remaining cases 
l= p,p—1. Since |G(S,_1)| < f(p +4,p—l), we find (since p + q > 2(p—1) + 1 for 
l= p,p— 1) that in these cases the equality f(p + q,p —l) = (p—!)(p + q) holds, 
and hence 


IGn| = |Kiptal + |G(Sn-1)| < (p+ a) + (p—D(p +9) = vip + q). 


The extremal construct in case (3.14) is Kpp+q. 
(3.12). This case has been realized; the extremal construct is Kp p+4q- 
In the cases (3.10)—(3.12) a general estimate holds: If p > 2, g = 1, then 


H(1; Kppi1) S max ((p + 1)?, [F]). (3.15) 


In fact, with the notation of the proof of (3.9), if A > 4, then t(G(S)) < p—A+1, 
since Kppi1 D Kia + Fap—a42) (which is true if 2p + 1 > 1+ A+ 2p + 4— 2A, or 
A > 4), and hence G(S) D Fap_a+2). Consequently, 


IGn| < A? + |G(S)| < A? + f(A,p- A+ 1) =A? 4+ (p—At1)\(At]) = 
=pA tpt l< (pt 1). 


If A < 3, then clearly |G,| < [8n/2]. Thus, (3.15) has been proved. 
For n > 6, (3.15) takes in the cases (3.10)-(3.11) the form 


un; Kppyi) S [22] ; 
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FIGURE 3.8. TJ, 


FIGURE 3.9. Aj, 


(3.10). Let Gn be the extremal construct. The case n = 5 is trivial: Gs = Ko3. 
For n = 7, if G7 D C7, then |G7| = 7; if x(G,) = 2, then |G,| < 3[n/2], and this 
bound is realized by K33. Let n = 9 and consider the shortest odd cycle contained 
in Gg. If it is Co, then |Go} = 9; if it is C7, then for realizing (3.15), the remaining 
two vertices must be incident to six edges, which is impossible. Hence, here also 
x(Go) = 2, ie. |Go| < 3[9/2] = 12, and this value is realized by the graph Tg. 

(3.11). If n is odd, then (3.15) is realized by 7, (see Figure 3). If n = 11, then 
(3.15) is realized by the graph Aj. Ifn =1 (mod 4), n > 13, then (3.15) is realized 
by A,. If n = 3 (mod 4), n > 15, then (3.15) is realized by Ag. (See Figures 3- 
3.) The sufficiency of these constructs can be verified immediately, and is based on 
the fact that for p = 2, q = 1 a graph G,, is a sufficient construct if and only if 
A(Gr) < 3, Gn BD C3,Cs. 

(3.13). Silvester’s theorem, given in Chapter 2, substantiates the existence of the 
following graph, for nonnegative integers a and 6, 


An = Kpiipti 1 3 |B] < (p+ 1)?, 
aT +7, if 3[2] > (p + 1)?, 3a + 40 = [4], 


a 
2 
Te 
2 


where 7g = K33 and 7g is isomorphic to the edge graph of the ordinary three-cube. 


FIGURE 3.10. A; 
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FIGURE 3.11. A3 


We show that A, is a maximal construct. Since |A,| = max((p + 1)?, 3/n/2]), for 
n even it coincides with (3.15), so that (3.13) is proved for n even. Let n be odd. We 
show that p(n; Kp 11) < max((p + 1)?, 3[n/2]). For this we consider an arbitrary 
extremal graph G, with A(G,) < 3. It contains odd cycles, and exactly [38n/2] edges 
(if one of these conditions is not satisfied, the required estimate follows at once). In 
turn, the condition |G,| = |3n/2] implies the existence in G,, of exactly n—1 vertices 
of degree 3 and one vertex of degree 2. Consider the odd cycle C, C Gy. Clearly, 
r >2p+ 1. Consider a path Pop41 C C, not passing through the degree-2 vertex of 
G,, (this can always be done, even if this vertex belongs to C,, since r > 2p + 1). 

For the sake of being specific, let 


Pop41 = {(a1, 61), (b1, a2), (@2, b2),.. +, (Bp, p41) 


and S, = {b,...,b,} and S,;1 = {@1,-..,@p41}. Consider all edges of G, incident 
with vertices b; and not belonging to the path P2,;,. Since the cycle C, is maximal, 
it does not contain chords, hence the number of such edges is p. We denote the set 
of ends of these edges by V = {v,v2,...}, and the graph formed by it and the path 
Pop+1 by H. If V contains a vertex v such that (v,}:),(v,6;) € H with 7 —7> 1, 
then the subgraph of H induced by the vertex set Sp41 + (Sp \ b:) + {uv}, where 
i <1 < J, is connected and, being colorable by two colors, has p — 1 vertices of one 
color and p + 2 vertices of the other color, i.e. cannot be included in K,,;1- In the 
opposite case all vertices of H have degree at most 2, and if (v, b;), (v,b;) € H, then 
|i - 7| = 1, i.e. vertices of degree 2 are incident by ‘neighboring’ };. 

Here we consider two cases. First, let p > 4 and let, for the sake of being specific, 
(v1, b1), (v2, bp_1) € A. It is clear that then v; # v2 (since the subgraph of H induced 
by the vertex set (S,+1 \ {@pi1}) + (Sp \ {bp }) + {ur} + {ve} is a (2p + 1)-vertex tree); 
it cannot be included in K,,41, since it has p — 1 vertices of one color and p + 2 
vertices of the other color. Finally, let p = 3. If |V| = 3, we can act as in the case 
p= 4. If |V| = 2, then the similar tree not imbeddable in K3,4 is induced by either 
the vertex set {v1, v2, 61, b2, a1, @2,a@3} or the vertex set {v1, V2, be, b3,a4,@3,a2}. O 


We! say that a graph G is an H-graph if any subgraph generated by |V(H)| 


vertices contains H as a skeleton subgraph. Let m(n,H) be the smallest possible 
number of edges in an n-vertex H-graph. In Theorem 3.2 we have shown that for a 


! The following material (on local properties of hypergraphs) was written in collaboration with A.V. Kostochka. 
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graph H which is a k-cycle, for any n > k we have 


m(n, H) = PA). (3.16) 

In this case the lower bound is trivial, since the minimal degree 6(G’) of any 
k-vertex subgraph G’ in a H-graph cannot be less than two. Of course, this very 
lower bound holds for m(n, H) for any k-vertex H with minimal degree equal two. 
We show that (3.16) is true for n > 8k? for any 2-homogeneous k-vertex graph H, 
if there is at least one cycle of length larger than three in it. Moreover, we show 
that an n-vertex graph Gr with [n(n — k + 2)/2] edges is an H-graph for every 
2-homogeneous k-vertex graph, except if this k-vertex graph consists of 3-cycles only. 

Below we set Nc(v) = {w € V(G): (v,w) € E(G)}, degg(v) = |Ne(v)]. 

Since for even n the construction is very simple, we regard this case separately. 


LEMMA 3.3. Let G = (X,Y; E) be a bipartite graph without 4-cycles, |X| > 2, 
[Y| > 3, |X UY|> 7. Then 


dre X: degg(x) < |Y| -— 2 
and, moreover, either a) 


drj,%2 EX, yiyoE VY: 21 F Xa, yr F Yo, (L1, 1), (L2,y2) ¢ E(G), (3.17) 
or b) |X| = 2 and some verter in X is adjacent to all vertices in Y. 


PROOF. Let X = {x,...,%s}, Y = {y1,---, ye}, dege(a1) < degg(x2) < +--+ < 
degg (Zs). 

Case 1. t > 4. If dege(a1) > |Y| — 1, then |Ne(a1) MN Ne(x2)| > t — 2 > 2, 
and G contains a 4-cycle passing through x; and x2. Hence a) holds and there are 
yo. y1 © Y \ Ne(x1). 

Assume that b) does not hold. if there is a y € Y \ Ne(x2), then we may assume 
that y # y and, taking yo = y, we obtain a tuple 21,22, y1, y2 satisfying (3.17). 
Hence Ng(x2) = Y. Since G does not contain 4-cycles, we thus find s = 2. 

Case 2. t = 3. Since |X UY| > 7, we have s > 4. Since b) is symmetrical with 
respect to x and y, it has already been dealt with in Case 1. 

Assume that a) is not true. Then 


degc(x;) = 2, Via S: 


But for a set of three elements there are only three distinct pairs of vertices. Hence, 
because s > 4, we find x1 # x2 with Ne(x1)N Ne(x2)| > 2, i.e. G contains a 4-cycle 
passing through x7; and zz. O 


THEOREM 3.4. Let H be a 2-homogeneous graph on k (k > 7) vertices, in which 
the length of a maximal cycle is larger than 3. Then for n > 8k? the smallest number 
of edges in an n-vertex H-graph is equal to [n(n — k + 2)/2]. 


PRooF. As noted above, it suffices to construct for each n > 8k? a graph G = 
Gre, With n vertices and [n(n — k + 2)/2] edges, in which any subgraph generated 
by k vertices contains all of the sets H considered as a skeleton subgraph. 

Part 1. n even. For n > 8k? there is (see, e.g., [108]) a (k — 3)-homogeneous 
bipartite n-vertex graph G) = (X,y; FE) with |X| = |Y| and without 4-cycles. Put 
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G = G, (i.e. G is the completion of G;). Clearly, G is an (n — k + 2)-homogeneous 
graph. Let V'c X UY, |W’) =k, X'=XOV', Y'’=YNOV". Since X and Y have 
equal cardinalities, we may assume that 


|x") < [YL 


Then |Y’| > 4. Let H be an arbitrary 2-homogeneous graph from the conditions of 
the theorem. If X’ = @, then G’ = G(Y’) is a complete graph and H C G’. 

Case 1. X' = {x}. Since degg(x) = n—k + 2, there are y1, yo € Y’ adjacent to 
x. Since G(Y') is a complete graph, this implies the inclusion H C G’. 

Case 2. X' = {x,,X%2}. Since x; and 22 are adjacent to at least one vertex in 
Y', by Lemma 3.3 there are y; # yo in Y’ such that (21, y1), (v2, y2) € E(G"’). By 
requirement, H contains a cycle C) of length larger than 3. It can be included into 
G", using the path (y1, 71, V2, ye) and (if |V(C1)| > 4) vertices in Y’ \ {y,,y2}. The 
remaining cycles in H can clearly be included into the remaining complete subgraph 
of G’. 

Case 3. 3 < |X'| < |Y’|. Suppose H consists of the cycles C),...,C,, |C;| = ¢; 
(i = 1,...,7). Let 7 be the index for which 


Cte +61 < [X'| <a t+ t+ 5, 


and let 
b= |X'| —c, — +++ — 65-1, a=c;,—b. 
If b = 0, then C,...,Cj_, can be included into X’ and C;,...,C;, into Y’. 

If 6b = 1, then by Lemma 3.3 we can find an x € X’ adjacent in G’ to certain 
vertices y1, y2 from Y’. Distribute C),...,C;-1 onto X’\ {x}, Cj by using the path 
(y1,X, Yo), and C;41,...,C,, onto the remaining vertices of Y’. 

If a = 1, then we proceed similarly to the case b = 1. 

If both a,b > 1, then we fix vertices 21,22 € X', y1, yo € Y' with (21, y1), (12, y2) € 
E(G") (these exist according to Lemma 3.3, and then place Ci,...,Cj-1 into X’ \ 
{1,72}, Ch41,...,C, into Y’ \ {y1,y2}, and Cj onto the remaining vertices such 
that the edges (x1, y1) and (#2, y2) are used. This proves Part 1. 

To prove Part 2 we need the following 


(yi, Yo), (y3, ys) ¢ E(G). Then for any co,...,c such that 


2, i=0, 


seh zs 
Cot:::+G=p and a2] <i<t (3.18) 


> > 


G contains a subgraph consisting of a path Py and cycles C\,...,C; such that 
i) |V(Po)| = co; 
li) |V(Ci)| =a @=1,...,t); 
iii) Po joins y with ys. 


PROOF. For p = 4 the only choice satisfying (3.18) is co = 4. 

Let p > 4, and let yi,...,y4 satisfy the conditions of the lemma. For cp = 2 we 
consider G’ = G \ {yi, ys}. If p — 2 = 3, then, since yz and y, are isolated vertices 
in G’, we have G’ = K3. If p— 2 > 3, |E(G’)| < 1, the inclusion of C},...,C;, 
into G’ is obvious. If, however, |F(G’)| > 2, then we take two arbitrary edges 
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(ys, Ye), (yz, ys) € E(G’) and apply the induction hypothesis for t/ = t — 1, ¢ = ci41 
Be Oye 5b 1) ys ye (9 dies 4h 

Let co > 2. If E(G)| > 3, then we choose in G \ {y,,...,y4} an edge (ys, y6) 
and apply the assertion of the lemma to G” = G \ {y:}, with § =c@—1,ci=«G 
(@=1,...,¢), yf = yyse Gi = 1,.-.,4). If, however, E(G) = {(y1, ye), (y — 3, y)}, 
then the assertion of the lemma is obvious. O 


Part 2. n odd. For n > 8k? there is a bipartite (k— 2)-homogeneous (n+ 1)-vertex 
graph G, = (X1,¥%1; £1) without 4-cycles. Let v be a vertex from X1, Ng, (v) = 
{y1, oa ,YR-2}, G. = (G, \ {v}) U { (Yoi—1, Yori): ] < 1 < (k = 2) /2}. Since the girth 
of G is less than 6, the girth of G2 is less than 5. Moreover, 

IY ONe,(x)| <1, Wr EX, \ {v}, (3.19) 


where Y = {y1,..-,Ye-2}. Put G= Go = (V,E), where V = XUY, X =X, \ {v}, 
Y = Y,. By construction, 


_ fn (n + 1)(k — 2) k—2!|  [n(n-—k+2 
(2-2 eas fe 


As in Part 1, let V’ CV, |V') = &, X= XNV',Y' =H YOV'. If GY) isa 
complete graph, we reason as in Part |. Thus, we may assume that 


{m,y2tCY, (yy) EE, — deger(y1) 2 dege» (ys). (3.20) 
Condition (3.19) is equivalent to the following: 
IY ANe(x)|>|¥|-1, Were X. (3.19') 


Case 1. Y' = {y1, ye}. Then |X| > 5. Since 6(G’) > 2, we can find 21,22 € X’ 
adjacent to yo. By (3.20), (3.19’) there are x3,2%4 € X’ \ {1,22} adjacent to yj. 
Since G(X’) is a complete graph, we can easily include H into G(X' U {y1, y2}), 
using the paths (xj, ye, X%2) and (x3, y1, 24). 

Case 2. Y' = {yj}, yo,z} Since 6(G’) > 2, we can find 21,72 € X’ such that 
(21,91), (%2, Y2) € E. By (3.19’) we can choose x; # x2. Thus, if H contains at least 
one cycle of length larger than 4, then H can be included into G’, using the path 
(11, Y1, 2, Y2,%2). Consequently, H consists of cycles of lengths 4 and (possibly) 3. 
If there is an x € X' adjacent to y,; and yw, then the 4-cycle (x,y), z, y2) and the 
complete graph G(X’ \ {x}) show that it is possible to include H into G’. Thus, 
using (3.19’) we have 


X'\ Nery) = X' 0 Nev (y2). (3.21) 


As noted, there is an x2 € X’ with (x2, y2) € E. Assume that |X'N Ne (y— 1)| > 3. 
Because of (3.21) and the absence of a 4-cycle in G, we have |X'N.No@ (y1) Ne (z)| > 
2. Let r, € X'NNer(y1) Nor (2), {x3, 24} C (X'NNer(y1)) \ {x1}. Then the 4-cycle 
(Xo, Yo, Z,21) and the path (x3, y1, 74) show that it is possible to include H into G’. 
Thus, |Ne(y1) A X"| < 2. Since k > 7, by (3.20) this means that k = 7, |X’| = 4, 
|X’ M Nor (y)| = |X’ M Nov (y2)| = om = C3 U ore Let Ne (yi) \ {z} = {x1, x3}, 
X’ \ {2,73} = {x2, 24} Since G does not contain 4-cycles, {71,23} N Nev (z) # 0. 
Let (x,,z) € E. Then G’ contains the cycles (x1, z,y1) and (ye, re, 13, 24). 
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Case 3. |Y'| > 4. Suppose H consists of the cycles Cj,...,C,, |V(Ci)| = G@ 
(i= 1,...,r),¢) <--- <c, and let 7 be such that 
Cte $61 <|¥'| <q +--+. 
Put b= |Y’|—c, —----cG-1,a = c,;—b. If b = 0, then by Lemma 3.4 we can include 


C),...,Cj-1 into G(Y"), and C;,...,C, can be trivially included into the complete 
graph G(X’). 

Subcase 3.1. E(G(Y')) = {(y1, y2)}, 21,22 © Y’\ {y1, yo}. First, suppose a = 1. If 
|X'| > 1, i.e. 7 <r, then |X"| > 5. Since G' does not contain 4-cycles, we can then 
find an x € X' with Ne(x)N {z1, z2} 4 O. By (3.19), Ne(x)N {y1, yo} 4 0. Assume 
that {(y1,x)(x, 21)} Cc E. Then it is easy to include C),...,C; into G(Y’U {z}), 
using the path (y,, 2, 2), and Cj4),...,C, can be included into the complete graph 
G(X' \ {x}). However, if X’ = {x}, then by (3.20), (x,y) € EF and since 6(G’) > 2 
we can finda y € Y’ \ {y— 1} with (z,y) € E. Thus, we can include C, into G’, 
using the path (y,x,y1) (recall that c, > 4), and we can include C}),...,C;—1 into 
the remaining complete subgraph G’ \ V(C4). 

Suppose now that 6 = 1. Then |X’| > c,—1 > 3 and, by (3.20), |No(y)AX"| > 2. 
Taking into account that (21,41), (%2,y2) € E, we can place C; into X’U {y;}, 
using the path (2), y1,%2), and we can place C),...,Cj_, into the complete graph 
G(Y' \ {y.}), and Cj41,-..,C, into the remaining complete graph G(X’ \ V(C;)). 

Finally, let a,b > 2, {2,22} C X". Since G does not contain 4-cycles, g contains 
an edge joining {7,22} and {z,z2}. We may assume that (x1,2,) € E and, by 
(3.19'), (x2,y1) € &. Then we can easily include C),...,Cj—, and a path of length 
b — 1 joining y, and z, into G(Y'), and Cj41,...,C, and a path of length a — 1 
joining x; and x2 into G(X’). 

Subcase 3.2. |E(G(Y’))| > 2. Let {ys,y4} CY’ \ {n, ye}, (ys, ya) ¢ E. 

For a = 1 we choose an x € X’ at random. Because of (3.19’) we may assume that 
{y1,y3} C Nc(x). Thus, by Lemma 3.4 we can include C),...,Cj into G(Y’U {z}), 
using the path (y, x, y3). 

Let a,b > 2, {x,,x22} C X’. Again because of (3.19) we may assume that 
(21,91), (@2, y3) € &. By Lemma 3.4, we can include Cj,...,Cj_,; and a path of 
length 6 — 1 joining y; and y3 into G(Y’). We can include the cycles Cj41,...,C, 
and a path of length a — 1 joining x; and 2 into the complete graph G(X’). 

Only the variant 6 = 1 remains to be investigated. As in subcase 3.1, here we 
have |X’M Ne(yi)| > 2, and by Lemma 3.4 the cycles Cj, ...,Cj—1 can be included 
into G(Y' \ {yi}), unless |Y’| = 4, 7 = 2, cq) = 3. The cycles Cj,...,C, can be 
trivially included into G(X’ U {y}). So, let Y’ = {y1, ye, ys, ys}, C1 = 3. Renumber 
the cycles such that c) = max{c;: 1 <7< r}. We obtain one of the variants already 
considered: 6 = 0,a = 1 or a,b > 2. This proves the theorem. O 


REMARK 3.1. It can be seen from the proof that Gr,x is an H-graph not only for 
2-homogeneous, but also for a whole series of other graphs H with 6(H) = 2. 

The local properties of graphs give rise to many very difficult extremal problems. 
Here we consider only one such property: the local Turan property. We say that an 
n-vertex k-graph GE c C*(S,,) has the local Turan property if 


WSp C Sn 3S_q C Sp: C¥(Sq) c GE. 
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The main problem is to compute the quantities 
m(n, p,q, k) = min|G*|, 


where the minimum is over all locally Turan n-vertex k-graphs G*. The relation 
between the Turan and local ‘Turan properties is obvious: T(n, p,q) = m(n, p,q, q). 
However, their interrelations are not exhausted by the fact that one is a particular 
case of the other. For example, the definitions imply that if a g-graph F? has the 
Turan property, then the k-graph 


GS OS) 


S,€F9 
has the local Turan property. 
THEOREM 3.5. The following formulas hold: 
1) ifp <k(q— 1)/(k— 1), then m(n, p,q, k) = ("Pt ; 
2) ifn < q(p —1)/(q—1), 2k >q +1, then m(n,p,q,k) = (n— p +1)(2). 
PrROoF. Let G* have the local Turan property, and let the set 
k(p-q) 
He >~ C%(S,) 
i=0 
be such that 
k 
VS € SCS), \) Ss, SA +S as, CS5e CUS): CC. SCS. 
i=0 


To prove the existence of such a set H we consider a sequence of subsets fF = 
{S} <i<; according to the following rule: 


SY = S, € C*(S,) \ CF, 
k , a 
sero (Sn ‘ose se) : WSp D STS VS, C So, 
vi j=1 


C*(S,) C GF = S® ¢ Sy. 
Suppose this process breaks off at level (step) 1. We put 


l 
Hey Ss? 
i=1 


and show that | < p—q. Then, by construction, |S@| < k and thus |H| < k(p—q), so 
that H clearly has the required properties. If 1 = p—q+1, then |H| < k(p—q+1) < p 
by the sufficiency of G*: 


W6e > AS, C552 CUS.) CG 
Let S, and Sg be precisely such. Then, by construction, 
VSY EF: SY ES, 
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thus, |(S, \ S,) A S| > 1, and hence (by the pairwise disjointness of sets in F: 
VS, SY EF: S9AS® =p), 


we obtain a contradiction: 
p-qtl1 


p—q=|Sp\Sel => >> |(Sp\ Sq) S| > p—@t1. 
i=] 


It is obvious that 
VSq C Sp A: |Sgn Al > |A| — (p—4). 
Hence, if 
U(Sp»;Sq,G) = |{ee G: S,Ne= Sq}, 
then by the sufficiency of G* and the definition of H we have 


ISI) (Sa HI) — (\Hl—p +a 
u(S.\ #864 > (15) (OG) > ( b~|s| ) 
Then 


l= Y uS\ES,e)> ¥ Geree 


SCSn\H SCSnr\H 


- > E 7) ee ‘ : ‘ . ‘ | 


or |G*| > (Geass! Since the construct C*(S,_p4q) is sufficient, this implies the first 
assertion of the theorem. 
We represent the minimal locally Turan k-graph G* in the form 


Gre |) CNS), 
SE Fa 


where the Turan q-graph F? has the following two properties: it is critical, ie. by 
removing a g-edge of it, it no longer has the Turan property; it has the property of 
pairwise separatedness: 


VS S8 € Ft: SONS = 6. 
Then 


|G*| > T(n, p,q) (7) =e pt »(Z). 
D k 
and if always . . 
AS) SOLE Ft: SO ns! AG, 
then 


Ja € S,: v(a,1;G*) > @! 
We show the presence of such a vertex. It is immediately clear that if 


k> (54 — 8q + 4)? —q + 2), 
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then there is such a vertex, since under these k conditions the presence of two 
intersecting g-edges S®, so € F4 implies 


. . 4 SYA SM — | 
Va € S)) Sp) : w(a, G4) 22(7 - ( aa 


IV 


Let now F% be a critical Turan k-graph, and let 
(1) OG) of!) . Oz) (z) (1) 
Von Oe Oe Ed” ay Sey ay =e. 


If also 
) oG ; i 
450), 59) € FT: (SO S| < 4, 


then for such q-edges, 


Vae So nso) : v(a,1;G*) > a 


If, however, 
WS}? Si) € F*: either |S}? 1S? > 5 or SP) NSP = 0, 
then F? is clearly not critical. The alternative 
VSO, So sO e Ft: SON SONS ZO 

implies the required immediately, provided 

VSO, SO € F?: either |S 1 S| =q—-1or SONS = 0, 
but the latter implies, by the ‘Turan property and criticality of F%, that 
Thus, for this S941, 

Va € Sq41: v(a,1,G*) > ). 


In turn, the presence of vertices of arbitrary high degree is guaranteed by induction 
with respect to the number of vertices. Clearly, m(p,p,q,k) = (2) foranyp>q>k. 
Suppose this equality holds up to n — 1 inclusive. We show that it holds also for 
n. Assume the contrary: m(n,p,q,k) < (n-—p+ (a): It is obvious that if the 
k-graph G® is locally Turan, then the k-graph obtained from it by removing all k- 
edges incident to an arbitrary vertex is an (n — 1)-vertex k-graph, and is also locally 
Turan, with the same parameters. Suppose a € S,, is a vertex such that 


v(a,1;G*) > a 
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Then, by the preceding Remark and the induction hypothesis, 


mi(n -_ Lipase) < |G*| _ v(a, 1; G*) < m(n, p,q, k) = (1) ase 


<(n— p+ n(f) - (i) = n=) (Z) 


contradicting the induction hypothesis. 0 


4. Asymptotics for local properties of graphs 


If it is known that a k-vertex graph G; is contained as a subgraph in the complete 
t-partite graph with parts of k; vertices (k,} > --- > ki, kj +--- +k = k), then, 
clearly, the following ‘chromatic’ information can be deduced: 


x(Ge) <t, — x(Ge) > ky. 


Can we extract ‘chromatic’ information concerning G, if such an inclusion as above 
is not present? It turns out we can. 
The outer chromatic number of a k-vertex graph Fj; is the number 


/ = . 
x (Fe) = min x(Ge), 


where y(G;) is the ordinary chromatic number of the k-vertex graph G;. Unlike the 
ordinary chromatic number, the outer chromatic number can be explicitly calculated. 


LEMMA 3.5. If Fy # Ky and Fy, has t connectedness components, with k; vertices 
in component i andk, >--->k >1,k+---+kh =k, then 


2 7 $1. 


/ a . 
x (Fe) = min p= 


PrRooF. If G, Z Fy and Gz C Kny..n,, where (n1,...,nr) tk, then Knyin, LZ Fr. 
Hence the equality y’(F;,) = r is equivalent to 


Dy, os te) PRS ey ae Cs 
V(n1, wea? Ne) F k : | ar oar CS Ey, 


Thus, if r is the largest integer for which 
VOhigete stig) FS Bape Js 
then y'(F,) = r + 1. In turn, 
Bastin CB > Tagg, OP 


or 
Ke hee ee 


The latter inclusion clearly holds if and only if (m),...,n,) D (ki,...,k:) H &, where 
k, is the number of vertices in the 7th connectedness component of H;. Consequently, 
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the number r looked for is the number r(kj,..., ki; &) from Corollary 2.1, according 
to which (taking into account that Ff, #4 Ky, and, thus, k; > 1): 


This proves the lemma. 


We now note a relation between the ordinary and outer chromatic numbers. The 
inequality 


/ 
x(Gx) > ymax xX (Fx) 


follows immediately from the definition of outer chromatic number. In particular, 
it allows us to obtain an explicit lower bound for the ordinary chromatic number. 
In particular, we have 


C= gat hy ~ 1 1 
X(Ge) 2 max min |—F—7— | +1, 


where the maximum is taken over the partitions (k),...,k,) | & for which the graph 
Kx... does not contain the graph G, as a subgraph. Therefore, the optimal bound 
is obtained for partitions (k,,...,k,) / & maximizing the function r(k,,...,k;;k). 
This bound is well-known: If b is the largest number of independent vertices in the 
graph Gx, then x(Gxy) > k/b. We derive it from the previous inequality. For this we 
put 


A, — Ko44 ae (k -—b- 1) Ky. 


This means that the graph H;, consists of the complete graph Ky,,; and k —b—-1 
isolated vertices. Then 


1 


= k- k 
A, — I, D Gk and x (Gx) > x’ (Fy) — -] + ] - re 


b 


Somewhat different concrete general bounds of the chromatic number in terms of 
the outer chromatic number are given in the Problem Section. Knowledge of the 
precise value of the outer chromatic number makes it possible to asymptotically 
solve problems on local properties, and to explicitly compute the coefficient in front 
of the principal asymptotic term. 


THEOREM 3.6. Suppose a nonempty k-vertex graph H;, consists of t connectedness 
components, with k; vertices in componenti and ki >--->k >1,k)+--- +k =k. 
Then 


m(n; He) = kal 
2 Miny-j4 ean 
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PROOF. This theorem follows immediately from the Erdés-Shimonovich theorem 
and Lemma 3.5: 


mn; Hy) = (") — p(n; Fe) = (") — f(n;{Ge: Ge Z Fe}) = 


n 1 n? ra n? or 
. ( - (14 ry) F tom - OF) 172 tO") = 


n? 


ate] 


2 mini: ky>1 kj—-1 


5. Elements of Ramsey theory 


Already in Chapter 2 we have noted an extremal result established by Ramsey. 
Results of a similar nature are nowadays formulated in a separate branch, which 
quite often is called Ramsey theory. The problem theme in it can, in essence, be 
reduced to the two following problems. 

If a large structure is partitioned into disjoint parts, then how many substructures 
are bound to belong to a single part? Conversely: How rich can the large structure 
be if any partition of it contains a part of a prescribed nature? The basic initial 
result of this type, which is also the basic instrument for solving similar problems, 
is the Dirichlet principle. 

We illustrate this by the standard examples of Ramsean kind. 

e Among any three persons there are two of the same sex. 
e Among any six people there is either a triple of pairwise acquaintances or a 
triple of pairwise strangers. 


While the first example is a simple reformulation of the box principle, the second 
is a particular case of Ramsey’s theorem. Consider a proof of this example. For any 
person in this sextet of persons there are p acquaintances and q strangers among 
the remaining five, so that p and q take the values 0,1,...,5, with the condition 
p +q= 5. It is obvious that max(p,q) > 3; for the sake of being specific we assume 
that p > q. Consider these p acquaintances. If there is at least one pair of mutual 
acquaintances among them, we have found a triple of acquainted persons; if there 
is no pair of acquaintances among these p persons, then we have a triple of pairwise 
strangers, since p > 3. This proves the required assertion. 

We must note that the amount of six persons is extremal; in fact, it is the smallest 
number with the above property, since there is a selection of five persons not having 
this property, i.e. in which there is no triple of pairwise acquainted persons nor a 
triple of pairwise strangers. This results from the intransitivity of ‘being acquainted’, 
regarded as a binary relation, and the obvious fact that the complete five-vertex 
graph Ks can be represented as the union of two disjoint pentagons: Ks — Cs + 
Cs, where the first cycle is the graph of pairwise acquaintances and the second 
that of pairwise strangers. This suggests that the relation ‘being acquainted—being 
strangers’ can be adequately represented by coloring the edges of the complete graph 
with two distinct colors, e.g. pairs of acquaintances with red and pairs of strangers 
by blue. Ramsey’s theorem can be conveniently stated precisely in terms of edge 
colorings of graphs. 
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THEOREM 3.7 (RAMSEY THEOREM (PARTICULAR CASE)). For natural numbers 
r and s there is a smallest integer R = R(r,s) such that in any full edge coloring 
of the complete R-verter graph Kp with two colors (with each edge colored by one 
of these two colors) there is either a complete subgraph K, of the first color or a 
complete subgraph K, of the second color. 


Thus, our example establishes that R(3,3) = 6. In general, the exact computation 
of such Ramsey numbers for graphs is a difficult and by far unsolved problem. The 
values of R(r,s) for even the most natural parameter values are far from being 
known completely, as has been indicated in Table 3.1, which gives all presently 
known information concerning the exact values of the Ramsey numbers for graphs 
when coloring with two colors. 


Table 3.1. Estimates and exact values of the Ramsey numbers R(r, s) 


We have taken this table from [{206].? 

Ramsey numbers may be computed not only for ordinary graphs, but also for 
l-graphs, and, moreover, more than two colors may be used. 

By analogy with this result we may consider the packability of number partitions 
in terms of colorings of 1-graphs or 1-subsets, and it is therefore natural to extend 
this packability to /-graphs and to compare the corresponding extremal bounds 
with Ramsey numbers. Let C(r,s) be the smallest integer C' such that in any edge 
coloring of the complete C-vertex graph Kc with two colors there are edge-disjoint 
subgraphs K, and K, that are each of a single color. The following fact confirms 
that the difference between C' and R for 2-subsets already is less striking than for 
1-subsets, but that, on the other hand, these numbers coincide in essence. 


PROPOSITION 3.1. [fr > s, then C(r,s) = R(r,r). 


2On March 19, 1993 we received the following by e-mail from B.D. McKay and S.P. Radziszowski: ‘We have 
proved that R(4,5) = 25. Our proof is computational. For integers s,¢ define an (s,t,n)-graph to be an n-vertex 
graph with no clique of order s or independent set of order ¢. Suppose that G is a (4, 5, 25)-graph, with 25 vertices. 
If a vertex is removed from G, a (4,5, 24)-graph H results; moreover, the structure of H can be somewhat restricted 
by choosing which vertex of G to remove. Our proof consists of constructing all such structure-restricted (4, 5, 24)- 
graphs and showing that none of them extends to a (4,5, 25)-graph. To reduce the chance of computational error, 
the entire computation was done in duplicate using independent programs written by each author. The fastest of 
the two computations required about 3.2 years of CPU time on Sun workstations.’ 
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PROOF. Clearly, C(r,s) > R(r,r) always, and if R(r,r) —r+1 > R(s,s), then 
C(r,s) = R(r,r). In fact, put R = R(r,r) and consider an arbitrary coloring of 
the complete R-vertex graph Kp with two colors. This coloring must include a 
one-colored graph K,, say on the vertices |r]. Consider now the coloring on the 
remaining vertices [r, R] = {r+ 1,..., R}. Since |[r, R]| = R-r+1 > R(s,s), this 
coloring contains a complete s-vertex graph K,, which can intersect the graph K, 
in at most one vertex. Hence we have obtained the required configuration. 

We now show that if r > s, then R(r,r) —r+1 > R(s,s) always. For this it 
suffices to give a coloring of K ps,5)+r—-2 With two colors, not containing a one-colored 
complete graph on r vertices. Represent K p/s,s)4r—2 aS a sum of disjoint subgraphs: 


K R(s,8)+r—2 = K R(8,8)—1 at Ky-1 a K R(6,8)—1,r—1; 


where the edges of KA(s,s)-1 are colored with two colors such that this graph does 
not contain a one-colored complete subgraph on s vertices, all edges of K,_, are 
colored blue, and all edges of Krxs,s)-1,,-1 are colored red. It is obvious that with 
this coloring the graph K p;s,s)+r-2 does not contain a one-colored complete subgraph 
on r vertices. 


Not only hypergraphs can be taken as structures to be partitioned, but also to- 
tally different sets, e.g. of numerical or geometrical objects. The earliest number- 
theoretical facts in Ramsey theory are the following theorems. 


THEOREM 3.8 (SCHUR’S THEOREM). In any partition of the set of natural num- 
bers in finitely many parts, there is a part containing numbers x,y,z such that 
Zty =z. 


THEOREM 3.9 (VAN DER WAERDEN THEOREM). In any partition of the set of 
natural numbers into two parts, there 1s a part containing an arithmetical progression 
of | terms, a,a+b,...,a+(l—1)b, whatever the given finite length 1. 


The extremal form of van der Waerden’s theorem is the problem of computing 
W(n), the smallest integer W such that in any partition of the set of the first W 
natural numbers, |W] = {1,...,W}, into two parts, there is a part containing an 
n-term arithmetical progression. This is an open problem, as is the computation of 
Ramsey numbers for graphs. 

The best upper bound for W(n) is due to S. Selah [205]. 

Geometrical facts of Ramsean type emerged somewhat later. The earliest of them 
has been considered by Erdds and Szekeres, in the form of the following extremal 
problem posed by E. Klein. 

Compute NV (n), i.e. the smallest integer NV such that we can choose from N points 
in general position in the plane (i.e. no three lying on the same line) n points forming 
the vertices of a convex n-gon. 

In particular, it has been proved that 


2 = 
rte Min) s (™ ea 
n—2 


and it has been conjectured that the lower bound is the exact value. This conjecture 
has been confirmed only for n = 3,4,5. It was precisely the estimation of N(n) that 
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led Erdés to the re-discovery of Ramsey’s theorem, somewhat later than Ramsey 
himself, however. 

In specific forms of Ramsey statements one considers, next to incidence relations, 
also special relations: in number-theoretical ones additive relations, and in geo- 
metrical ones configuration and metrical relations. The following is an example of 
calculation with metrical relations. 

For any coloring of the points of the plane by three colors there is a pair of points 
of the same color having distance one. The problem on the smallest number of colors 
for which this property does not hold is open; it is only known that this number lies 
in between 4 and 7. 

In the course of solving the problem of Klein on convex polygons, an extremal 
problem concerning permutations emerged (see Problem 3.25). 

Nowadays, combinatorial permutation problems form a separate direction, con- 
sisting both of purely combinatorial ‘permutational’ statements, and of problems 
connected with group properties of permutations. This theme deserves special con- 
sideration, which would lead outside the scope of this book. We restrict ourselves 
to given some statements in the form of problems. 


6. Problems and assertions 


PROBLEM 3.1. Suppose we are given a hypergraph F' C P(S,,) and a system of 
hypergraphs W = {G,...}, G C P(S,), on the vertex set S,. Moreover, suppose 
we are given a binary relation R C P?(S,) on the Boolean P(S,). For G C P(S,) 
we denote by R(G) the complete image of G under R, ie. R(G) = {X € P(S,): 
de € G: eRX}, and for S C S, we introduce the quantity 


degy(S) =|{G Ew: Se R(G)}|. 
Then 
>= degy(S) = S25 |R(G) Fl. 
GEW 


SEF 


PROBLEM 3.2. If G is a hypergraph and S a vertex set, then 


v(Sp,p;G)\ ISM es, ) 
pa k 7 2X ( p 


1<ig<:-<ip<[G] 


eee ae ey (" — ) 


S,CS e€G q P—4q 


PROBLEM 3.3. Compute the sum 


u(Sp, 95 “) 


PROBLEM 3.4. If a graph Hy consists of t connectedness components with | ver- 
tices in each component and k — lt isolated vertices, then 
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PROBLEM 3.5. If the graph H, does not have isolated vertices, then 
ne 


PROBLEM 3.6. Forbidden subgraphs. 
{HeC,) = eS +1; the extremal construct is the complete (n — 1)-vertex graph 
Kn-1, complemented by an edge incident to the nth vertex; 


F(n; {C3,Ca,...}) =n—1; 
pmcn= (7S *)+ (757), l<n< 2-3; 


2 
Gh n < 2l, 
f(nsCan)=42)+ (3), Wsnsal-1, 
Ea ; n> 41-1; 


n(il—1) r(l—-r) 
nn” ae 
n—1l 


f(n; {Ca, Ce, Cs,...}) =n —1+ P|. 


f(n; (Ci, Coga,. +, Cr}) = n=qg(l—-2)+r, 0<r<l-2; 


In the last case the extremal construct is the graph C!(a)C1(S, \ {a}) + F(Sp \ {a}). 
PROBLEM 3.7. Forbidden subgraphs: the ‘degenerate’ case. 
fc) scons 
f(n; C4) = 0.5n'* + O(n); 
f(n; {Ca,Cs}) = (0.5n)" + O(n); 
q We< f(n; K33) < ce: ne? 


PROBLEM 3.8. (?) The Erdés—S6s conjecture. For every k-vertex tree T, we 
have f(n; 7) < n(k — 2)/2. 


PROBLEM 3.9. If n > k > 2 and k is even, then every n-vertex graph without 
isolated vertices contains a proper k-vertex subgraph without isolated vertices. 


PROBLEM 3.10. Prove the following equation: 
k-1 nS k 
. P, — > 
m(n; Pe) i mpw kth) ifn>k. 


PROBLEM 3.11. Let M(n,k) be the minimum number of edges in a graph G,, C 
C?(S,,) such that 


VS~ C Spda € Sp \ Se: C'(a) OES) C Gx: 
Then 


mn, k) = (k—1)n— & + a re 
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and the extremal construct has the form 
O(Sa) = CO (Sa) EF Spe): 


PROBLEM 3.12. Let E(n,t) be the maximum number of edges in an n-vertex 
graph with number of independent edges at most t. Then 


ar: 1, Cee Seren 


Sas) iim S Bie. 


PROBLEM 3.13. If aj(G;) is the largest number of disjoint independent vertex 
[-sets in a graph Gy, then for any / such that 2 <1 <b+1 we have 


R=ta(G\= J is 


(Gi) > | ee 


PROBLEM 3.14. If 6(G;) is the smallest degree in a graph G,, then 


k-1 
x(Gk) = aol ad 


PROBLEM 3.15. The maximum number of k-edges in an n-vertex k-graph GE C 
C*(S,,) not containing a triple of k-edges A, B,C € G*® such that AAB C C is, for 


k = 2,3,4, equal to 
B ee] n+k-1 
ell pole ee 


PROBLEM 3.16. The minimal total number of triangles in an n-vertex graph and 


its complement is 
n\ |n (" —1 ) : 
3 2 2 


Hint. Show that the total number of triangles in and n-vertex graph and its 
complement can be expressed by 


(2) EC 2 “)-()) 


where dj,...,d,, are the vertex degrees of the graph. Thus, one has to minimize this 
expression, taking into account that the sum of all degrees in the graph is even. 


PROBLEM 3.17. The edge chromatic number of a graph G is defined as the small- 
est integer X for which there is an edge coloring of this graph with X colors such that 
any two adjacent edges have different colors. Show that if A is the maximal degree 
of G and t is the largest number of independent edges in G, then A< X <A +1, 
while if |G| > At, then X = A +1, andif A> 2t+1, then X =A. 


PROBLEM 3.18. How many edges must an n-vertex graph have if between any k 
vertices there are t independent edges? 


PROBLEM 3.19. (?) Try to compute p(n, Kp + K,). 
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PROBLEM 3.20. a) We call a graph countably-covering-connected if for any pair 
of vertices in it there is a system of countably many vertex-disjoint paths joining 
these vertices and covering all vertices of the graph. Show that a plane graph is 
Hamiltonian if and only if it is countably-covering-connected. 

Hint. Use a sufficient condition for being Hamiltonian, due to Tait: a plane 
four-connected graph is Hamiltonian. 

b) (?) Is the previous criterion for being Hamiltonian true for nonplanar graphs? 


PROBLEM 3.21. Let ng be the smallest n such that for any edge coloring of the 
complete graph K,, with two colors there are two one-color triangles, possibly not 
of the same color, but without common edges. Then ng = 7. 


PROBLEM 3.22. Let np be the smallest n such that for any edge coloring of the 
complete graph K,, with two colors there are two one-color triangles of the same 
color and without common edges. Then ny = 8. 


PROBLEM 3.23. Show that for the Ramsey numbers the following inequality holds: 
R((r —1)(s —1) + 1,(r — 1)(s — 1) + 1) > (Rr, s) — 1)(R(s, s) — 1). 


Hint. Consider K pi,,7)-1, color it with two colors such that it does not contain a 
one-color subgraph on r vertices in which each ‘vertex’ is a complete graph K As,5)-1, 
colored with two colors such that it does not contain a one-color subgraph on s 
vertices. 


PROBLEM 3.24. Let m(n,2,3) be the maximum number of hyperedges in n n- 
vertex hypergraph such that each two hyperedges have a nonempty intersection, 
while each tree hyperedges have empty intersection. Then 


m(n, 2,3) = [1 + (8n + 1)°%9]. 
analyze the relation between m and the Turan numbers 7'(n, k,/). 


PROBLEM 3.25. How many terms can a maximal monotone subsequence in a 
rearrangement of the first n natural number have? 


PROBLEM 3.26. What is the average (in the set of all rearrangements) length of a 
maximal monotone subsequence in a rearrangement of the first n natural numbers? 


PROBLEM 3.27. Universal key for naming files. Assume that the unknown name 
of a file consists of n different symbols. How short can a subsequence of n symbols, 
in which all possible rearrangements of these n symbols are present as subsequences 
from subseries of distributed elements of the sequence be? 


PROBLEM 3.28. Universal ruler. How many lines, N(n), should be arranged on 
a ruler such that precisely any integer from 0 to n can be measured by these lines? 


PROBLEM 3.29. Local Ramsey property. Let H;, be an arbitrary k-vertex graph 
with at least one edge, and let m > 2 be a natural number. Show that there is a 
smallest natural number R = LR(Hy,m) (R > k) such that for any R-vertex graph 
Gr one of the two following conditions holds: 


4G; C Gr: Ay S Gr or cl(Gr) > m, 
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where cl(G) is the clique number of the graph G, i.e. the number of vertices of the 
largest complete subgraph in G. 

Prove the following equations 

(1) LR(K2 U (k — 2)K1,m) = R(k, m); 

(2) LR(P3 U (k — 3)Ky,m) = max(k, R( Ky — Fe, m); 

(3) suppose Ky ,-1 C Hx; then 

PRG = k — cl(Hy) + max{m, cl(Ax)}, Fr FZ (Ky — Ax), 

max{k,m + max{m— 1,k — cl(H,)}}, so (Ky, — Hx) 


(use Theorem 3.1); 
(4) LR(Ka3, m) = 3m-— 2 (m > 3) 
(concerning the solution of this problem see [59]). 

PROBLEM 3.30. Let R(G,H) be the smallest R for which any edge 2-coloring of 
the complete graph KR has either a subgraph G of the first color or a subgraph H 
of the second color. (The existence of R(G, H) follows from Ramsey’s theorem.) 

If nG denotes n disjoint copies of a graph G and n > 2, then 


R(nK3) = R(nK3,nK3) = 5n. 
If D is the four-vertex graph consisting of K3 and an adjoined edge, and n > 2, 
then R(nD) = 6n. 


CHAPTER 4 


Extremal geometrical problems 


This Chapter is devoted to yet another direction of extremal problems: concerning 
discrete families of geometrical objects. As an application we study the connection 
with extremal problems on hypergraphs, and also give some applications in matrix 
algebra. 


1. Linear normed spaces 


A nonempty set X is called a linear space over the set of real numbers R if an 
addition operation is given on X, with respect to which X is closed and has a neutral 
element 0, and if there is defined multiplication of elements of X by real numbers 
(from R), with as result elements of X. In addition, the following conditions must 
hold: Va,3€R,2,y € X: a(x) = (aB)x; (a+ B)x = ar+GBr; a(xt+y) = axrtay. 

A linear space is also called a vector space, and its elements x € X are also called 
points or vectors. Instead of over the set of real numbers R, we may consider a 
linear space over the set of complex numbers C (or, in general, over an arbitrary 
field P). 

A system of vectors 21, %2,...,%4 € X of a linear space X is said to be linearly 
dependent if there are numbers ay, Q2,...,@q € R, not all zero, for which a,x, + 
Qglo +--+ + AgXq = 0. In the opposite case, i.e. such a1, Q2,...,@q € R do not 
exist, this system of vectors is said to be linearly independent. We say that the space 
X has dimension d if it contains a system of d linearly independent vectors, while 
every system of d+ 1 vectors is linearly dependent. If there is a linearly independent 
system of d vectors for arbitrarily large d, the space is said to be infinite dimensional. 

A linear space X is said to be normed if to each vector x € X there corresponds a 
number ||z||, called the norm of this vector, such that the following conditions hold 
fora, GER, z,yEe xX: 


\|x|| > 0, x0 (nonnegativity), 
||ax|| = |a|- [|x|] (homogeneity), 
lz + yl] < [lel] + llyll (convextity, or the triangle inequality). 
A subset Y of a normed linear space X is called a subspace if it is itself a normed 
linear space with respect to the operations of addition of vectors and multiplication 


by scalars used in X. In other words, Y C X is asubspace of X ifz,yEeY,a,BeR 
imply ax + By € Y. The norm defined in X is also a norm in Y. 
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We consider some examples of linear normed spaces. Let p > 1. We denote by 4 
the space whose points are sequences of numbers x = (21, £2,...) for which 


fore) 
S- |x|? < O&O. 
t=1 


The norm of such a sequence in €, is defined as the number 


fore) 1/p 
fei (3 lar | 
i=l 


The triangle inequality for this norm is ensured by the condition p > 1. A space 
é, may be infinite dimensional, and may be finite dimensional, of dimension d. In 
the latter case its points are number sequences of length d (d-dimensional vectors), 
and for them the requirement that the sum of powers of their constituents is finite 
is clearly fulfilled. 

A limit case of the spaces 4, is the space of bounded number sequences with norm 


|x|] = sup [zi]. 
u 


We denote this space by é.. 

The infinite-dimensional space £2 is called the Hilbert space. A space lz of dimen- 
sion d < oo is d-dimensional euclidean space, and is also denoted by R?. For any 
two vectors x2, y € £2 we have the equality 


lz + yll? + lle — yll? = 2(la|/? + llyll?), 


called the parallelogram rule, or parallelogram law. For € with p # 2 this rule is 
not satisfied, in general, but we have a chain of inequalities: 


gmin(LP—I}(\ |? + lly?) < [lar + yl? + lar — yl? < 20-9 (lla + [lylP)- 


The space @2 has the very essential feature that in it a scalar product can be 
defined: for x, y € 2, 


(x,y) = G(llz + yll? + IIx — yll?). 
lt ee (Gia = ay toe ss )y Len 
(2,9) ad Driv; 


and (x, x) = |x|, (x + z,y) = (wy) + (2,9). 

The unit sphere in a normed linear space X is the set of all x € X satisfying 
|z|| = 1. The norm of each normed linear space is uniquely determined by the 
shape of its unit sphere. For example, in the two-dimensional case, the unit sphere 
in 2 is the circle of radius one; the unit sphere in @; is the square with vertices 
(0,1), (1,0), (0,-—1), (—1,0); and the unit sphere in @,, is the square with vertices 
(1,1), @,-1), (-1,1), (—1,-1). In the three-dimensional case, the unit sphere 
in €2 is the ordinary three-dimensional sphere of radius one; the unit sphere in @; 
is the tetrahedron with vertices at (0,0,1), (0,1,0), (1,0,0), (—1,0,0), (0,—1,0), 
(0,0,—1); and the unit sphere in @,, is the cube with vertices at (1,1,1), (1,1,-—1), 
(1, als Dy (1, c= =I), ok L 1); tal, 1, =); (=1, =k 1), (ls =I, Ls 
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We denote by o = {x1,2%2,...} an unordered set (system) of vectors x; € X. 
When necessary, we indicate the number n of vectors in o by a subscript: on. The 
system o may contain identical vectors, therefore the notation 0, C Om, forn <m, 


means that if om = {@1,.-.,%m}, then on = {2i,,...,2i,}, where 1 < i) < +--+ < 
ln < m. Hence there are i possible choices of such n-subsystems. Sometimes we 
impose metrical restrictions on a system a, of the type: o = {21,%2,...: ||zi|| > 1, 


2 = 1,2,...}. These are always explicitly mentioned. We put (0) = Sze, x and 
loll = I(o) I] = Il Sree all. 
We note another useful identity. If k >1 > t and o; C ox, then 


De in) = (0 (07)) + mod ()) 4). (4.1) 


01 [01 Cor 
In fact, 
 Q= > 285 
o1Co,Co, o1Co, Coz FEO} 

= + Ss waeenl= “ae SS xivceaql= 
o1Co Co, FECk LEOK = 0,CoiCox 

=Sizc dX) xfteants+ Yor YX x{xEea}= 
rEort a1Ca,Co, LECK—9t = a1Ca, Co, 


= a7] + (O% -od(7-) 1): 


2. Extremal geometric constants 


The theme of extremal geometric constants includes both the calculation of ex- 
tremal numerical characteristics of systems of vectors and the spatial description of 
systems of vectors that are extremal with respect to some property. 

We will now compute several concrete extremal geometric constants. 


CONSTANT A. Let A(k,!; X) be the largest number A for which 
Vopr CX Ao, C ox: |lou|| > Allon — oll. 
If k > 21, then 


l 
A(k, l; X) = ——; AD 
(bX) = (4.2) 
if k < 21, then 
l 
:X) = A(k, 1; €0) = ——. 4. 
inf A(k, 1; X) A(k, l; €.0) 31k (4.3) 
PROOF. Let oj C ox be a subsystem such that 
max ||ou|| = loill- 


Then for k > 2/1 we have by (4.1), 


>, (01) = (0% — 9) (" — i; 


a CoK—-9; 
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Taking in this identity norms on both sides and using the triangle inequality, we 
obtain 


l 
lloull = Glee — all: (4.4) 


If k < 21, then we have by (4.1), 
YS (a) = (0% — %) + (03) are 
Puy NORE SEV Opadey ore 
On—-9, Co Con 


and transition to norms gives 


i 
lloill = sp —zllee — all: (4.5) 


The fact that (4.4) and (4.5) cannot be improved upon in the class of all normed 
linear spaces is demonstrated by the following construct (k,l; &&,) of k unit vectors 
in 05: 

L1 = (=I, (21 a Deore v9 , (21 7 Ly); 
L2 = ((2/ _ Omer a ny (21 a Ly) 
a = ((21—1)7!, (21 —1)7?,..., -1). 


Moreover, in each X equality in (4.4) is realized by a system of k vectors, which 
proves (4.2). O 


CONSTANT B. Let B(k,r,!; X) be the largest number B for which 
Vor C X Vopr C o~ Jor C on: |lou|| > Bllor ||. 


Then with the same methods it is easy to prove that 


= eae ead al 
Bik rt Xx) = ie k>l+r> 2r, 

U(k—l) 

r(k}l—2r? k<l+r, Lr, 


and there is a one-dimensional construct realizing these values. 


PROooF. By (4.1), ifk >r>1>1ando, C ox, then 
—] 
(a1) = — - (Or) i= ), 
2s, 21 
hence we have 


L 
max |loil| 2 Ello, 


which clearly cannot be improved upon, as demonstrated by the pencil of k unit 
vectors. 
Let k >l +r > 2r, on C ox. By (4.1) we have 


k-r-1 k-r-1 k-—1\{k-r-1 
( beat » (a) = (F277 d, a) =(on(7,) sl ) 


or Go Cor o1CoK—Or 
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This implies that ||oy|| > /||o,||/(21 — r). Both these estimates cannot be improved 
upon in the class of unit vectors. It suffices to take for o, the system U(k, l; €5,); 
moreover, the last inequality cannot be improved upon for all X. 

Let k<l+r,l> 1, 0 C ox. Then by (4.1) we have 


(tae) 2 - (Ga), 


or CoCo. On—Fr Gal Co, 


ON WM eterescil te east) Gos ou) 


whence ||o;|| > l(k — 1) |lo,||/r(k +1 — 2r). This estimate cannot be improved upon 
for all X. O 


CONSTANT C. Let C(k,r,l; X) be the largest number C' for which 
Vor CX: SO flor] >C Y= |laill. 


OrCo,K o;CO,K 
Using the recurrence 
i-1 
Chi Xx). > [] Ck, 4,2 +1; X) 


and the triangle inequality, we find 


A system of k distinct vectors is an extremal construct in this case. 

In general, if a system of vectors o is free of any metrical restrictions, then the 
problem of computing the extremal geometric constant is essentially one-dimensional. 
In particular, this is demonstrated by the computations of the constants above. The 
situation changes if we introduce restrictions on the lengths of vectors. 


CONSTANT delta. Let 6(1,k; X) be the largest number 6 for which 
VO {Giese ©, (Gall 2 1 Sse RVC OX See es |lay|| 2. 
In other words, we must compute the constant 
6(l,k; X) = inf, max ||oq||. 
OnCX FCOK 
lei}2>1 


THEOREM 4.1. Let X be a normed linear space, H a Hilbert space, and R? d- 
dimensional euclidean space. Then the following formulas hold for 6(1,k; X): 


inf 6(1, k; X) = 6(l, k; lo) = 54 kd: (4.6) 


2l—1° 


66+ 1;2,;)=—— dim, >14 1; (4.7) 


2l—1°? 


R-2 imé, > 
VES) SORE) = , k even, dimé, > d(k), (4.8) 


k odd, dim ¢, > d(k): 


142 4. EXTREMAL GEOMETRICAL PROBLEMS 


moreover, d(2) = 1, d(3) = d(4) = 3, d(5) = d(6) = 10, d(7) = d(8) = 7; 


(2 RHO. d+2<k< 2d: (4.9) 
sup 6(1,l +1; X) = 6(1,1 + 1; 42); (4.10) 
X:dim X =oo 
0.5 
6(1,k; H) = (ao) ,  dimH>k-1; (4.11) 


moreover, (4.11) is realized by the regular k-verter simpler inscribed in the unit 
sphere of R*~!; 


otk — 1, ke) = 1, dim H > 2, (4.12) 


6(2,k;R?) = 2cos a (4.13) 
moreover, (4.12) and (4.13) are realized by the system of vectors in the plane forming 
the vertices of the regular k-gon inscribed in the unit circle; 


0.5 
(5 + 4 cos 24) . eo 
6(3, kj R?) = 9 148°" k=5 (4.14) 
0, k= 3 


if dim X = 1, then 


l k>2-1 
6(3,k;R?)=(k—-l, k<2l—1, k even, (4.15) 
i) kell, kodd 


1 


We prove some of these formulas, marking, for convenience, the proof of these by 
their sequence numbers. 

Put k =1+1 in the A, then (4.6) follows immediately from (4.3). The definitions 
of B and delta imply the inequality 


6(1, k; X) > Blk, 1,1; X)6(r, kX), 


which for r = 1, k > / also implies (4.6). Moreover, (4.6) is a direct consequence of 
Theorem 4.2 below. 

To prove (4.7) it suffices to exhibit a system of vectors in @{t! with norm of a sum 
of any ! of these equal to the righthand side of (4.7). Clearly, the following is such 
a system: 


Pe eee 1) 

- ee a 
ili sl) 
ae ea 

wn tly mea a 4 | 
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since 
141 


S52; — Zi 
jel 


for it. 

(4.15). The case k > 21 — 1 is trivial. Let k < 2/ — 1. Consider k numbers, with 
absolute values not less than |, and, for the sake of being specific, of which qg are 
negative and p positive: p+q =~k,p>q, {%1,.--,%p,-Yiy--  — Yeh, lS ar <--- < 
Lp ly < ++: < yg. If p > l or q = I, then there are / numbers of the same 
parity, and the absolute value of their sum is not less than /. Therefore we assume 
that p,q < Jl. It can be easily seen that the maximal absolute value of the sum of | 
distinct numbers from the k given numbers is taken by one of the following sums: 


Oy a Re ee Ee ap a2 = Yi st Ug Ly = Leg: 
However, 


qa; + (l—p)ag 
l—p+q 


,, |(pq— plan] J UE) 


max(lar|;lazl) > ee 


It remains to note that the fraction l[(k — l)/(l — p + q) takes its smallest value for 
p—q = Oifk is even, and for p— q = 1 if k is odd. 

The constant 6 can be viewed from a somewhat different perspective, allowing us 
to introduce and study some of its useful modifications. More precisely, with each 
concrete subsystem of vectors 0; C ox we may associate the sum 


k 
> Ci Li, 
i=1 


where ¢; = x{X; € a1}. So, we may associate to each selection of a concrete subsys- 
tem of vectors o, C o, a rearrangement of k numbers ¢;, among which there are | 
ones and k —1/ zeros. Hence, selecting a maximizing system o; C ox is equivalent to 
mastering all rearrangements of these numbers, i.e. 


k 
S Cn (i) Li 


6(l,k;X) = inf, max 
onCXxX TES, 
|zal]>1 


> 


where S; is the symmetric group of all permutations on the set [k] = {1,2,...,k}. 

It is now clear that this mastering over all rearrangements can be done not only 
with vectorial coefficients, consisting of ones and zeros, but with arbitrary numerical 
vectors. Thus, we consider the following constant: 


> Wri) Li 


6W;X)= > max 
aex.” 
re]21 


where W = (w1,..., Wx) is a fixed numerical vector. Hence, if W consists of | ones 
and k — l zeros, the latter two constants coincide. 
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The constant 6(W; X) can be naturally called a weighted geometric constant. We 
put 


k 
(W, Ty Ok) = S- Wr(i) Ti; 
gai 
and consider the functional 
||W, o%|| = max ||(W, 7, ox) ||, 


where the maximum is taken over all rearrangements 7 € Sx. This functional is 
invariant under rearrangements of the components of W, hence in the sequel we 
assume that w, > --- > we, and w = w, +--: + we. The dimension of W is 
sometimes indicated by a subscript. The values of the weighted constant in the 
case of a degenerate vector, i.e. when there is a single component or no component 
at all, clearly are: 6(W,;X) = |w|, 6(0;X) = 0. Hence in the sequel we consider 
only nondegenerate vectors. In general, the dependence on the shape of the weight 
vector is essential. For example, below we convince ourselves that a nontrivial part 
of the problem of computing the weighted constant is related to balanced vectors, 
i.e. those W for which for every 7 we have w(w— w;) > 0. The least possible value 
of the weighted constant is given by: 


THEOREM 4.2. Let W,, be a nondegenerate weight vector. Then 


inf 6(W; X) = (Wa; foe) = Leen = wn) (4.16) 


fw wi] + fw — wal 


PROOF. Let 7 ,...,7, be rearrangements of the numbers {1,...,n}, and aj,..., ax 
real numbers. Then 


k k 
> lasl|Wn, onll 2d 7 lasl|W, 73, onl] 2 


So(Wa, Nj, On) 


a; 


’ 


so that 
(jw — wy| + |w — Wnl)||Wr, On| = 


5 ee = a) Sine a1 Was On) = (w= Wi) Ee wtaen(Way On) || 

= (n— 1)! 7 

7 = wW-W— Ve. Wie W-— Un — . 
= lew Wn) (wie + ae ae) (w — wy) (wa + mt ya) 


= |w(w, — Wn) |||21|| >= |w(w; — wWp)]- 


Thus, the lower bound in (4.16) has been proved. 
If the weight vector is not balanced, then 
|w(wy a Wn)| _ 
ie ees 
Jw — wi| + |w — wr 
and (4.16) follows from the general upper bound 6(W; X) < |w|, which follows for 
a system of n distinct vectors. 
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If the weight vector is balanced, then 


|w(wi = Wn)| ww — wn) 


lw —wi| t+ lw— wal 2w—w) — wv, 
Put 
W, + Wn 
a= —————, 
2wW — W) — Wn 


and consider in the space @,, the system %, of n vectors of the form 


x, = (—l,a, ,a), 
Lo = (a, 1 ,a), 
Zn == (a,...,a,—1). 


Since W is balanced, we have |w| > |w, + w,|, hence all vectors in X,, have norm 
one. Moreover, for any rearrangement 7, 


(Was m, Xn)I| = max |(w — wpicg))a — Wars)|; 


and because the function |(w — z)a — z| is convex in z, we have 


w(w) — Wn) 


2w — WwW) — Wrl i 


[Wa 2a) | = tmax(|(w— ws a (w=) = 
In the sequel it is convenient to introduce a special kind of vector W, namely: W 
has dimension m and m— n zero components. We denote the final variant by 


(WA) SOWA) inf . max |Win Onl. 


A bound resembling the bound for the balanced case can be given also in the 
Hilbert situation. 


THEOREM 4.3. Let H be a Hilbert space of dimension at least m — 1, and let 


So ww; > 0. 
t#3 
Then 
n aa 2 0.5 
5(Win,n; H) = (Seo) (4.17) 


If dim H > 1 and 
So ww; — 0, 


then 
(Wn, n; 1) = |w. 
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PROOF. For 
Greetings pts lee tae Aaa ee (4.18) 
we put 5; = i) (n — i)}, 
A= y: So(Wa, 7, On)”, 
OnCOm 7 


where squaring is understood in the sense of the scalar product, and the inner 
summation is over all n-rearrangements. Then we have 


A= by yw, yet =f 2b, Si < J WW; So xix; = 
i=1 i=] 


i<j 


Tr m 2 n n m 
= be S> ww; eS n) ae q rs — bo Sw yx, 
1 


i<j i=l i<j i=l 


so that we have 


| A| =m (Sout —& Swe] if S> ww; > 0, 
i=l 


i<j 


|A| > m?be > ww; +m (s So wt — Sow if So wiwj <0. 


i>j i=] t>Jj 


Since the sum A contains (™)n! summands, there is at least one such summand that 
is not less than the average of the two bounds given above, so that by elementary 
calculations (4.17) and (4.18) as lower bounds follow. Taking this into account, the 
fact that 6(W;X) < |w| immediately implies (4.18). To prove (4.17) it suffices to 
take as extremal construct the system of m vectors in R™~! that are the vertices 
of a regular simplex ©,(R™~') inscribed in the unit sphere of R™~!. For this 
construct all summands in A are equal, and ||©,(R™~')|| = 0. So, equality holds in 
all estimates given above, and (4.17) is proved. O 


There is another modification of the constant 6 which is useful. Here we give esti- 
mates of its value without proofs, since the proofs differ but slightly from those for 
the corresponding estimates for 6: 

n 


> Wr (i) Li 


6y(W;X) = inf max 


’ 


OnCX TESn | 

ilrel}>1 - 

t=1,...,7r 
where W = (wi,..-, Wp) is a fixed numerical vector and S,, is the set. of rearrange- 
ments of the indices 1,2,...,n. 

We put Wmax = Max(wy,.--,Wnr), Wmin = Min(wy,..., Wn), 
|w(Wmax i Wmin)| 
é(W) = 


- |w a Wriic| + Jw = Wmin| 
and consider nondegenerate vectors only, ie. W #0 and n > 2. 


THEOREM 4.4. The following formulas hold for 6,: 
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for everyr <n, 


inf 6.(W; X) = 6(W); 


moreover, if W is not balanced, then VX: 6,(W; X) == 6(W) = |wl, and if W 
is balanced and X contains a subspace isometric to €,, then 


6.(W; X) = 6(W); 
for any space X and vector W € R”, 
6(W; X) = 6(W); 
(ay (ne, w? — w)) if Cicicjcn Wiw; > 0, 
lw (s)” if Li<icj<n wiv; <0. 


Contact numbers.The constant 6 is tightly related to the Newton—-Gregory con- 
tact numbers (the largest number of unit spheres that simultaneously can touch a 
central unit sphere). Indeed, if we consider the extremal constant 


5,.(W; &&) > 


1<i<j<n 
[ze l|=1 


d(k;X)= sup min_ |x; — z;|l, 
O,nCx 


then the contact number k(X) (for a space X ) can be defined as the largest integer k 
satisfying the inequality d(k; X) > 1. If X is a Hilbert pace, then the parallelogram 
law holds in it: the sum of the squares of the sides of a parallelogram is equal 
to the sum of the squares of its diagonals. Hence, if X is a Hilbert space, then 
d?(k; X) + 6?(2,k; X) = 4, and hence the contact number k(X) is the largest integer 
k satisfying 6?(2,k;X) < 3. 

In a euclidean space the exact values of contact numbers have been computed for 
a few dimensions only: k(R?) = 6, k(R*) = 12, k(R®) = 240, k(R™4) = 196560. 
The dependence of the values of contact numbers on the norm of the space (in other 
words, on the shape of the unit sphere in it) is very important. It is easy to construct 
an example of a space for which the contact number in dimension three equals 26. 
Clearly, this value is realized in the three-dimensional space whose unit sphere has 
the shape of the ordinary three-dimensional cube, i.e. in £,,. In this case the system 
of unit spheres realizing the contact number 26 is represented by three layers of unit 
cubes, with nine cubes in each layer (located precisely as the smaller cubes are in 
Rubik’s cube). In this case the central (invisible) cube touches all other cubes, while 
it touches certain cubes along faces, others along edges, and still others at vertices 
only. But let us return to euclidean space. Let Ma(r) be the largest number of unit 
vectors in R4 whose pairwise distances are at least r, and let Na(s) be the largest 
integer N for which there are vectors 21,...,2y € R¢ satisfying the conditions 


(es ee) =, ee ed: 
(24,23) <8, tf j. 


If s = 1—r?/2, then Ma(r) = Na(s), and Na(1/2) = k(R4). 
To estimate Na(s) we use the method of spherical polynomials. 
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Consider the polynomials Q(t) recursively defined by the relations 
ay =, seams = di, 


(d) ~ d+ 2k — (2) , trees k- (a) - 
There is a map 
k+d—2 k+d-3 
(@). pd_, Re = 
flO: BIR, db ened oar 


having the following properties. For any two vectors z,y € R%, where (z,z) = 
(y, y) = 1, the following equation holds: 


2,y)) = FO), LO). 


In particular, this implies that for any vectors 21,...,2) € R®? with (2x;,2;) = 1, 
ee bs 


> QO (a4, 25)) = Sy (8) (ei), FOG; n= (SA (7) 2 fiO(a)) > 0. 


ij=l1 Uj=l 


Consider now an arbitrary function 


t) = > a.Q (t), 
k 


where Ay > 0 for k = 1,2,..., and A(t) < 0 for -—1 < t < s. If the vectors 
X1,-..,£~N € R?¢ satisfy the conditions for the constant Na(s), then 
N N N 
NA(1) = 9- A((ai, 2) > D5 A((wi,21)) +2 Y5  A((xi,2;)) = D5 h(a, 23)) = 
i=l 71 1<icj<N ‘jai 
(d) = (@) a (d) 
=o yy MQe (Ga) = > Ae >, Oe (et) S06.) Oa (Gy 2) = DoN-, 
j=l ik k tj=l1 j=l 
so that h(1) 
N < —— 
= 


By constructing a suitable function h we can thus obtain an upper bound for the 
constant, Na(s). 
If, for d = 8, we consider the function 


h(t) = (+1) (48) 2 = 


(3) |. 2 Car (8) 208 7)(8) 808 77(8) 320 (8) 512 7(8) 
= 75 (40% + 7Q) + BBQs” + B8.Q2 + BQ” + BR8"), 


then we immediately obtain 
1 h(1)  h(1)960 
Ng {=} < —— = ——— = 240. 
; (5) aan 7 9 
If, for d = 24, we take for h the function 


Hor (Coy aed, 
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then we obtain by similar transformations: 


1 
Noa @ < 196 560. 


RR ACs 
306-380 5346-— ot 215 
500-595 7398-17 877 

12 582-915 10 668-25 901 
24-25 840-1416 17 400-37 974 
40-46 1130-2233 27 720-56 852 
72-82 1582-3492 49 896-86 537 
126-140 2564-5431 93 150-128 096 
240 4320-8313 196 560 


ONDA aohwnwe |i 


Table 4.1 
There are known (already for thirty years) concrete placements of spheres realizing 
these bounds, and giving precise values of contact numbers in euclidean spaces of 
the indicated dimensions. The exact values and best bounds for contact numbers 
have been given in Table 4.1. 


3. Some applications of geometric constants 


It is remarkable that in its original form, Mantel’s theorem was formulated and 
proved not in terms of graphs, but for vectors in euclidean space. An essential step 
in unifying extremal geometric problems and problems concerning graphs was made 
by P. Turan, who noted that among n vectors in a euclidean space there must be 
‘many’ pairs of such vectors with ‘long’ sums. A sufficiently general revelation of 
this connection is given by 


THEOREM 4.5. Letn > k >1 > 1 be natural numbers, and let X be a normed 


linear space. Then there are in each system on = {%1,---,%n: ||xil| > 1, 2 = 
1,...,n} C X at least T(n,k,l) subsystems o; C On such that 
llou|| = 6(1,k; X). 


PROOF. Taking the system o, C X as vertex set, we construct an /-homogeneous 
hypergraph G'(on) by the rule: 


a1 € G'(on) © |loul]| > 6(1, k; X), 
and we verify that this hypergraph has at least T(n,k,l) hyperedges. Assume the 
contrary; then 
Ao; C On: Vor Co: |loul| < 6(l, kX), 
or 
AG 0K max llou|| < 6(2,k; X). 
ocoh 


But then 


min max ||o;|| < i Meu < 6(l,k; X), 
o,CX o1Co, 
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so that 


min max ||| < 6(/, k; X), 


contradicting the definition of 6(k,/;X). O 


A probabilistic meaning of the relation between geometric and combinatorial ex- 
tremal constants was given by G. Katona. Its essence is that if among n vectors in 
a euclidean space there are ‘many’ pairs of vectors with ‘long’ sums, then by ran- 
domly choosing pairs of vectors, the probability that a pair has ‘long’ sum is large. 
An explicit probabilistic relation between combinatorial and geometric constants is 
demonstrated by 


COROLLARY 4.1. Letn > k > 1 > 1 be natural numbers, X a normed linear 
space, and €,...,& independent, identically distributed random vectors in X. Let 
T(n,k,l) be Turan’s number and 6(l,k;X) the geometric constant defined in §4.2. 
Then the following inequality holds: 


l 

Pt ly 6 
t=] 

For instance, if X is an arbitrary linear normed space, k = 3 and / = 2, then 


2 Pcs 
P{le+nl > sx} > sP(\ell > 2}; (4.19) 


however, if X = A, then 


> 26(I, ; xy} > (i aod PIE > x}. 


Pile +n] > 2} > SPA > 2}. (4.20) 


We give an application of (4.20) to the Bertrand paradox: what is the probability 
that. the length of a ‘random chord’ in the unit disk is larger than V3? The essence 
of the Bertrand paradox is that there are several natural values for this probability, 
in particular 1/2, 1/3, 1/4 (see [207]). 

On the other hand, for independent, identically distributed random points in the 
unit disk, 


P{lE— nl < V3} = PIE +n] <1} =1—-P{l€ +n] 2 1} < (4.20) < 
er 4 
<1-5P%{lel> 1} =1-5=5. 
Therefore, if both ends of the chord are independent and identically distributed, 
then the maximum of the probability considered in the Bertrand paradox is 1/2. 

More generally, if S is a bounded subset of R” and €,7 are independent and 
identically distributed, then the maximum probability of the event |€ — y| > a is 
equal to (k — 1)/k, where k is the maximum number of points 2),...,2% € S such 
that |x; —2;| >a,i4 j. 

We now note some applications of geometric constants to matrices. We consider 
square matrices of order n. The classical notions in matrix algebra are: matrix 
norm, generalized matrix norm, spectral norm, and numerical radius. 

A generalized matrix norm is a numerical function N on matrices satisfying the 
following conditions 


NCA) 20; 
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N(A) > 0if A #0; 

N(q@A) = |a|N(A) for a € C; 

N(A +B) < N(A) + N(B). 
If, moreover, N satisfies N(AB) < N(A)N(B), then it is called a matrix norm. An 
example of a matrix norm is the spectral norm 


|| Allo = max{a? A? Ax: x € C”, ||x7|| = |lz|| = 1}. 


An example of a generalized matrix norm (which is not a matrix norm) is the 
numerical radius 


r(A) = max{|x? Az| : x € C”, ||z7|| = ||z|| = 1}. 


Consider a matrix C such that C # AI and TrC # 0 (here, TrC denotes the 
trace of the matrix C’). We define a generalized matrix norm ro by 


ro(A) = max{| Tr(CUTAU)| : U? = U = I}. 


If C = diag{1,0,...,0}, then rc(A) is the numerical radius of A. 

For any generalized matrix norm N there exists a number v(N), the smallest 
multiplier v for which vN is a matrix norm (if v(N) < 1, then N is already a matrix 
norm). A matrix with real entries is called Hermitean if all its eigenvalues are real.! 


THEOREM 4.6. Let C be a Hermitean matriz and let W = (w,...,Wn) be the 
vector of eigenvalues of C' (i.e. w; is an eigenvalue of C’). If 6\(W;R) # 0, then 


Alw, +--+ + Wpl 
y(rc) _— ~§&(WeR) 


To prove this theorem we need several lemmas. 


LEMMA 4.1. Let N be a generalized matriz norm, M a matrix norm, andb> a> 
0 constants such that for any matrix A, 


aM(A) < N(A) < bM(A). 
Then v(N) < ba~?. 
PROOF. Put N’(A) = ba~?N(A). Then 
N’(AB) = ba~?N(AB) < ba~*M(AB) < ba~?M(A)M(B) < 
< ba*N(A)N(B) = N'(A)N'(B). Oo 


1 Recall that a number 2; is an eigenvalue of a square matrix A if and only if 


det(z;I — A) = 0. 
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LEMMA 4.2. Let A and C be normal matrices? with eigenvalues 21,..-,2n and 
W1,--.,Wn, respectively. Then 


n 
ro(A) = me 2 wats 


’ 


ro(A) || Alle. 


n 
<|\d wy 
jel 


PROOF OF THE THEOREM. Let H be a Hermitean matrix with eigenvalues 
Y1,++.5Yn- Then 


| lz = max ly), 
and, by Lemma 4.2, 


H)= 
ro(H) max 


Ds Wag) U5 
j=l 


It can be easily seen that rc(A) = rc(A’). Put H; = A+ A’, Hy = i(A— A’). 
Then A = (H, — iH2)/2, and the matrices H,, H2 are Hermitean. Since rc is a 
generalized matrix norm, we have 


rc(A + A’) = 2rc(A), ro(iA = iA’) = 2rc(A). 


> 6(W;R) max ly;| = 6(W; R)|| Ale. 


Therefore 
H Ro (it H H. 
ata) rol vt c(H) > 6(W:R) [Hille + Helle S 
H H. A 
5S sw. Ry Mil =i ale _ gwv;nylle 
Whence 
1h « an 
6(W;R) ms < [So wy] IIA lle; 
j=l 
and by applying Lemma 4.1 we arrive at the required bound. 0 
COROLLARY 4.2. Let C be a Hermitean matrix and w; > we > ++: > Wry its 


eigenvalues, w= Wy +--+ + Wn, W F Wn, w #0. Then 


A(|w — w,| + |w — wr) 
|w|(w — Wp)? 


v(rc) < 


As an example we note that for the classical numerical radius (the case C = 
{1,0,...,0}) this corollary gives a sharp bound, since v(r) = 4. 


COROLLARY 4.3. Under the conditions of theorem 4.6, 


6(W) = |w(Wmar — Wmin)| > 2\w|°* 
|w — Wael + |\w— Wminl 


where Tro 18 a matrix norm. 


? A square matrix A with real entries is normal if AAT = AT A. 
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A square matrix of order n and with entries 1 and —1 is called a Hadamard matriz 
if it satisfies AA’ = nI. The basic problem concerning Hadamard matrices reduces 
to the question of existence: For which n is there a Hadamard matrix of order n? It 
is not difficult to verify that Hadamard matrices can exist only if n = 1,2,4k. The 
following result, which we state without proof, reduces this problem to the problem 
of computing an extremal geometric constant. 


THEOREM 4.7. The following assertions are equivalent: 
1) 6(2,4n-—1; 471) = #2. 


4n—1? 
2) 6(2,4n; @j""") = 4; 
3) there exists a Hadamard matrix of order 4n. 


4. Problems and assertions 


PROBLEM 4.1. Does there exist a three-dimensional normed linear space with 
contact number of its unit sphere exceeding 26? 


PROBLEM 4.2. If the weight vector is not balanced, then for all X, 
6(Wim, ™; X) = |w]. 


PROBLEM 4.3. If the weight vector is of definite sign, and |w,| >--- > |w,| > 0, 


m>n, then 
WW 


ark) OW alee ea 


PROBLEM 4.4. If the weight vector is balanced and of varying sign, then 


" 2w — Ww, — Wr 


PROBLEM 4.5. If the weight vector is balanced and w; = —wp, then 


inf 6(Wr,m, A SOW eI Ca) = Why 


and, moreover, a system of m basis vectors in &.. is an extremal construct. 


PROBLEM 4.6. If X, is a simplex inscribed in the unit sphere of R*~! and wy,..., wi 
are real numbers, then for each o; = {x1,..-,21} C Up we have 
0.5 
: _ (kXj-)u? -w?\ 
a aay = em 
i=] — 


in particular, the length of an edge of ©, equals (2k/(k — 1))°°. 


PROBLEM 4.7. If dimH > n—1, En, = (q),.--,€n), & € {+1,—l}, @ > n, then 


2... 9 0.5 
6(Ey: H) = (" : 
= 


1 


If, however, dim H > 1, & < n, then 6(F,; H) = |el. 
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PROBLEM 4.8. If dimH >n—1, En = (41,---,€n), & € {+1,—-l}, then 


ile n=, 3, 
n2—]n05 3 Oe es 
sup 6(F,; H) = a ’ n—|n°[ even, n# 1, 
Br 0.5 
n2—(]n0-5 2 
ae , n— |n°*{ odd, n¥3. 


PROBLEM 4.9. Define the constant 6(2,k),...,k:; X) as follows: 
6(2,ky,...,ki;X) = inf, max min |lo9|l, 


on CX 920k, 


where the maximum is over all partitions of the system of vectors ox, as a multiset, 
into blocks of prescribed dimensions: 


t 
Ok = See, 
i=] 


and the minimum is over only those blocks that have volume at least two. Then 
every system on = {X1,---,2%n: ||zi|] > 1,2 =1,...,n} C X contains at least 


m ( 3 ks.) 
i=1 

subsystems o2 C op such that |loe|| > 6(2,k),...,k:;X). For the definition of 
m(n; H;,) see Chapter 3. 

PROBLEM 4.10. Try to compute or estimate the value of 6(2,k,..., k,;_X). 

PROBLEM 4.11. A system of unit vectors 0, is called an l-system if 

Vop C On oq C Op Vox CO—Q: |lox|| = 1 ER. 
Compute or estimate the constant 
A-(n, p,q, k; X) = sup |lon|l, 


where the supremum is over all [-systems on. 
Hint. Use the results in Chapter 3 concerning locally Turan hypergraphs. 


PROBLEM 4.12. Suppose six points in the plane satisfy the inequalities |a;—a;| < 
1(1<21< Jj <6). Prove that there are points ax, a1,@m among these six points such 
that |az — ai| <1, |jak — @m| < 1, |a: — am| < 1. 

Hint. Use Ramsey’s theorem. 


CHAPTER 5 


Application of the results of solving extremal combinatorial problems 


The efficiency of the functioning of automatic control systems is directly related to 
an increase in the scientific-technological level of designing them. However, up till 
now the solution of a number of problems in the technical design of automatic control 
systems is mainly based on attempts by and intuition of the developers. Such an 
empirical approach can lead already in various stages of development of automatic 
control systems to errors in design solutions, the removal of the majority of which can 
only be managed at a very high price. This relates in full measure to the solution 
of the problem of choosing the necessary size of operation computer memory, to 
the solution of the problem of optimal splitting of programs and variables, and to 
a number of other problems arising in the process of designing automatic control 
systems. 

The main instrument of study in the process of designing automatic control sys- 
tems is imitation modeling. However, its use comes to a long path, and requires 
a large amount of time [96]. On the one hand, using this method for designing 
automatic control systems ensures the necessary precision in estimates of values of 
the parameters investigated, at each stage of the design, and on the other hand it 
practically leads to a halt at the stage of determining the parameters of a number 
of technical tools from the general flow of development. For example, nowadays the 
final choice of the necessary size of operation computer memory is successful only 
at the stage of test run of the automatic control system [40]. 

A new theoretical approach to the solution of the problem of designing an au- 
tomatic control system is the use of methods from combinatorial analysis, and, in 
particular, from the theme of extremal combinatorial problems concerning number 
partitions. The high level of abstraction of the formulation and solution of extremal 
combinatorial problems allows us to use them in the design of both hardware and 
software of automatic control systems. Combinatorial methods of study assume a 
formalization of the various elements of a system by combinatorial objects. The 
collection of these objects forms combinatorial models, which make possible, on the 
basis of a priori information on the functioning of the elements in the system, the 
description of the whole set of their states. The use of the results of solving extremal 
combinatorial problems in the process of investigation essentially shortens the nec- 
essary amount of system states to be analyzed, and allows us to give a comparative 
analysis of indices of functioning, both using exact values of them and using (upper 
or lower) bounds of their values. 
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The principal aim of the examples of using the theme of extremal combinatorial 
problems concerning the set of number partitions lies in that the reader develops 
methodical experience on the formalization of the processes investigated here, using 
the notion of packability of partitions and extremal results for solving practical 
problems. Therefore, in the statements of the specific practical problems we do 
not give precise definitions of the processes investigated, and do not describe their 
cause-effect relationships with the process of functioning of an automatic control 
system in the large. 


1. Combinatorial models for studying the process of computer memory 
allocation of an automatic control system 


Studies related to improving the efficiency of methods of controlling computer 
memory allocation are mainly directed at the search for efficient methods for alloca- 
tion, re-allocation, and reorganization of memory. Allocation of memory is a finite 
sequence of maps (J — F),; (¢ = 1,...) from a set J of information objects (pro- 
grams, array data) or their names into the set F’ of physical addresses of the memory 
to be allocated at discrete time moments t¢ of functioning of the automatic control 
system. Re-allocation of computer memory is the transfer of a number of infor- 
mation objects from the address space in operation memory into auxiliary memory 
with the aim of emptying operation memory and distributing in it other information 
objects necessary for continuing the computing process. A reorganization of mem- 
ory is a displacement of information objects in address space. Re-allocation and 
reorganization of memory are one of the basic methods for improving the efficiency 
of using this costly computing resource of modern computers. 

There are two methods of allocating memory: statical and dynamical. A memory 
allocation is called static if (J — F); is chosen once and for all for executing the 
program of problem. In dynamic memory allocation each (J — F’); is chosen in the 
course of the computing process immediately at time t, starting from (J > F’);_1. 
The application of one or the other method depends on the presence of information: 


e on memory usage; 
e on properties of program referencing or the sequential use of information. 


Static allocation can only be applied if information on memory usage and pro- 
gram referencing are available before solution by the program. The use of dynamic 
memory allocation assumes that information on usage is not known in advance, and 
that the property of referencing is determined during the execution of the program 
only. It is this method of functioning that is present in real-time automatic con- 
trol systems. In such systems the requirements on operation memory are, in each 
concrete instance, determined by the nature and intensity of the flow of data for 
processing information, which, in turn, is random. In [14] it has been noted that 
efficient functioning of a real-time automatic control system can only be achieved 
when for the satisfaction of a demand to distinguish operation memory fewest possi- 
ble restrictions are imposed, while emptying of occupied memory sectors is done as 
fast as possible. These thoughts speak in favor of dynamic allocation of operation 
computer memory. 

Studies related to estimating the efficiency of applying various methods for con- 
trolling memory allocation pursuit mainly the following unique goals: 
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e free the programmer from worries about memory allocation; 
e improve the efficiency of memory usage; 
e minimize the expense of processing time in controlling memory allocation. 


In realizing both the static and the dynamic means of memory allocation, one of 
the main obstacles on the way of an efficient use of them is memory fragmentation 
[41], [95]. In studies of the appearance of memory fragmentation we can distin- 
guish two directions: stochastic, when the influence of fragmentation is considered 
as a random process, and deformational, when the process of functioning of the 
system leads to given, designed states of fragmented memory. The first of these ap- 
proaches is related to studies of the process of memory allocation in computers with 
segmented organization of programs and data, the second to studies of stackwise 
memory organization or to memory allocation by size-restricted free sectors. 

The loss in efficiency when using memory with segmented organization of pro- 
grams and data is conditioned by the influence of ezternal fragmentation or ‘scat- 
tering’ of memory at practically every time instant over the large amount of free 
and occupied sectors of various lengths. External fragmentation arises because of 
the random nature of the flow of inquiries on memory isolation, of the various sizes 
of these inquiries, which in the address memory space are placed like words, and 
also because of the random sojour time of programs and data in computer memory. 

‘Scattering’ of memory in the process of functioning of an automatic control sys- 
tem quite often leads to situations when in memory there is no free continuous 
sector of address space necessary to fulfill an entering memory inquiry. In this case, 
even if the total size of all available free fragments is equal or greater than the 
size of the part of free memory required, the entering inquiry cannot be satisfied 
without applying reorganization or re-allocation of memory. The very application 
of reorganization or re-allocation of memory requires an additional expense of the 
processing time in controlling memory allocation, which results in a reduction of 
the productivity of the automatic control system in the large. The loss in efficiency 
of memory usage when it is stackwise organized is conditioned by the influence of 
internal fragmentation. Internal fragmentation arises because of rounding the size 
of each entering memory inquiry to an integer number of stacks. This additionally 
distinguished sector of memory is not used in the process of executing the program, 
and also determines a loss in the efficient use of memory in the large. 

Stackwise memory organization essentially simplifies the solution of the problem of 
memory allocation, since the size of every stack is the same, and for every concretely 
chosen stack we may take any other stack. However, studies indicate that in the 
process of functioning of an automatic control system the loss in efficiency of memory 
usage conditioned by the influence of internal fragmentation turns out to be larger 
than the loss brought about by the influence of external fragmentation [41], [173]. 
Hence, by reducing the expenses in processing time for controlling memory allocation 
for segmented organization of programs and data, this can also greatly increase the 
efficiency of such a mechanism of memory control. Therefore our investigations will 
be directed to the study of processes of memory allocation of computing systems with 
segmented organization of programs and data, which have the following advantages 
in comparison with systems with stackwise memory organization: 


e the solution of the problem of organizing exterior referencing to segments is 
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greatly simplified, since in this case the unified programs are not required to 
work with absolute addresses; 

e the control is made easier by the solution of re-entering programs; 

e loss in efficiency in memory usage because of rounding the sizes of inquiries 
to the stack size adopted in the system (loss by internal fragmentation) is 
excluded. 


We consider several general combinatorial models allowing the study of the process 
of allocation of operation computer memory when there is segmented organization 
of programs and data. 


MODEL 5.1. At any time instant of functioning of the automatic control system, 
the influence of external fragmentation on the process of memory allocation can be 
sufficiently well characterized by the following parameters: 


e the amount of free (occupied) sectors of memory; 
e the size of the free (occupied) sectors; 
e the total size of free (occupied) memory. 


In these studies, inquiries on distinguishing memory are, at any time instant of 
functioning of the automatic control system, sufficiently well determined by the 
following parameters: 


e the amount of inquiries on distinguishing memory in queue; 

e the required sizes of continuous sectors of address memory space, or the sizes 
of the inquiries; 

e the total size of memory required to satisfy the inquiries from the queue. 


Let Q be the size of operation computer memory in an automatic control system, 
and N the total size of free memory, which takes values N € Zt, N < Q, in the 
course of functioning of the system, where Zt is the set of nonnegative integers. 
Because of the influence of external fragmentation of memory, n is ‘scattered’ into 
r free fragments, represented by continuous sectors of address memory space. Such 
a state of free memory may be interpreted as a vector 


ZUN. "iy cng Tie), Ne Yo rig, Ny 2 Ng SF +++ 2S Ny, 


n; € Zt, where n; is the size of the ith free sector of memory and r is the amount 
of such sectors. 


DEFINITION 5.1. Two states of free memory z(N) = (n1,n2,...,n,r) and z'(N) = 
(n},5,...,m,) are regarded as distinct if they are different as vectors, i.e. if there 
is an 7 such that n; £ ni. 


We can similarly interpret each state of occupied memory by a vector g(D) = 
(d,,d2,...,d:), where d; is the size of the ith continuous sector of address space 
occupied by memory and D is the total size of occupied memory. Two states of 
occupied memory are regarded as distinct if they are different as vectors. 

In modeling inquiries for distinguishing free memory from z(NV) = (nj, Nne,..-, Mr 
it is assumed that these can enter either simultaneously, i.e. in batches g(K) = 
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(ki, ko,..-,k¢), or alone, where k; is the required size of free memory at the jthe 
inquiry. We interpret a batch of inquiries also as a vector, i.e. 
t 
Gh ) = (hy; Royse ke), kek, kj >ko>--->k, 
j=l 
k; € Z*. Two batches of inquiries are assumed to be distinct if they are different as 
vectors. 

The components of the vectors z(V), g(D), q(K) (i.e. ni, dm, k;) are natural num- 
bers. Hence, z(NV), g(D), and q(K) may be interpreted as partitions of the numbers 
N, D, and K, respectively, ie. p(N) = (m,1e,...,nr), p(D) = (dy, do,...,dz), 
p(K) = (ky, ke,..., ke), where the sectors n; are defined as the sizes of free sectors of 
address memory space, the sectors k; are the required sizes of continuous sectors of 
free memory in address space or the sizes of memory inquiries, and the sectors dn 
are the sizes of continuous sectors of address space occupied by memory. The ranks 
of the partitions p(V), p(K), and p(D) are defined as, respectively: r, the number 
of continuous sectors of free memory in address space; t, the amount of inquiries 
in the queue; and /, the number of continuous sectors of address space occupied by 
memory. 

The interpretation of states of free and occupied memory by unordered number 
partitions allows us to adequately model external memory fragmentation without 
taking into account states of its address space, which essentially simplifies the in- 
vestigation of the process of memory fragmentation. In fact, in our subsequent 
investigations we will be interested in answering the question: Is there a free contin- 
uous sector of address memory space that suffices for fulfilling the entering memory 
inquiries? In this problem statement we do not need data on states of the address 
space of free memory. 

The representation of batches of inquiries on distinguishing memory by unordered 
number partitions also does not contradict the practical meaning of the process 
under investigation. If inquiries on memory goes in batches, then they all must 
be satisfied simultaneously, and the algorithm of allocating the inquiries can be 
arbitrary, as can be the order or sequence of distinguishing free memory sectors for 
them. Hence, the model 5.1 is an adequate representation of both the states of 
fragmented memory and the system of inquiries on distinguishing memory that can 
be formed during the process of functioning of the automatic control system. 

Using the set of number partitions we can describe the set of all possible states of 
fragmented free memory of fixed size. As already noted, in the process of function- 
ing of the automatic control system the total size of free computer memory varies 
between 0 < N < Q, where Q is the size of computer memory. Using the following 
property of the set of number partitions: 


P(Ny) 0 P(N2)N--+N P(N,) = 0, 


where P(N;) is the set of partitions of the number Ni, and N; #4 N; Vi # 7, we can 
show that the set of states of fragmented free computer memory of size Q is defined 
as the set of number partitions 


Q 
Z(Q) = U Z(N)=U UP), 


N=0 r=] 
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or (using cardinalities of sets) 


Q NA(Q+I-N) 
|Z(Q)| = y |Z(N)| = Dy d IPN )I, 
where P(N) is the set of number partitions of rank r. The truth of these equalities 
is confirmed in 


THEOREM 5.1. A partition p(N) € P(N) corresponds to a state of free computer 
memory of size Q if and only if N + r(p) — 1 < Q, where r(p) is the rank of the 
partition p(.N). 


PROOF. Necessity. Let p,() be a partition of a number N of rank r which 
corresponds to a state of free memory of size N. By the definition of fragmentation, 
between every two n; and n,;;, there is a segment d; of occupied memory. Suppose 
the minimal size of the segments is equal to one (d; > 1). Then, clearly, the number 
of occupied segments will be 1 > r(p) — 1. Hence the total size of occupied memory 
F’ > r(p)—1 =r(p—1). But since r(p) is the rank of p(N) € P(N) with0< N <Q, 
we have F > Q-—N. 

Sufficiency. Let N + r(p) — 1 < Q. Then the size of occupied memory is F’ = 
Q—-—N > r(p)—1. This means that we can find segments of occupied memory which 
fill all r(p) — 1 places between segments of free memory by these r(p) — 1 ones. This 
we need to prove. Then 


Q Q Q 
SIZNI= >> SS x~M= do YS x(N 4r(p) -1) <Q), 
N=0O 


N=0 pe P(N) N=0 p€P(N) 
where y is the indicator function, taking values as follows: 


y= 1 if p correponds to a state of free memory, 
aN a 0 if p does not correpond to a state of free memory. 


It is well known that |P(N)| = SS, |P-(V)|, hence 


512) |= SSW +r) -) <Q) 2 = 


N=0 k=1 pE€P(N) 


Q N 
= SE x(N + rl) - 1) $ QIPN)| = 


N=0r=1 
Q min(N,Q+1-—N) 


(rp) S<(Q+1—-N))|FAN) =>) Dd) IBN). 


N=0r=1 N=0O r=] 
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Thus, the number of states of free memory of size Q is defined by 


min(N,Q+1-—N) 


> |Z(N)| = > 2, |P.(N)]. Dp 


Studies of the process of memory allocation of automatic control systems also 
include the process of satisfying inquiries in address space of free computer memory. 
A formalization of this process is as follows. 
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MODEL 5.2. A specific feature of memory allocation in computers with segmented 
organization of programs and data is the indivisibility of entering inquiries on distin- 
guishing memory, i.e. to satisfy each inquiry one needs a continuous sector of address 
memory space of varying length. This kind of organization of memory allocation 
is applied in real telecommunication systems of KAMA type, in the allocation of 
operation memory of multiprocessor computing complexes of ELBRUS type, and in 
many other systems. Satisfaction of an arbitrary inquiry on memory is here real- 
ized by the successive execution of two processes: the process of searching for free 
continuous sectors of address memory space equal to or exceeding the size of the 
inquiry, and the process of assigning this free sector to the inquiry. 

Both processes may be realized by various algorithms; however, in the course of 
executing them the found sector of free memory is either completely excluded from 
the list of free sectors (if the sizes of the sector and the inquiry coincide), or in the 
list of free inemory the remainder of free memory, the difference between the sizes 
of the sector and the inquiry, is taken into account (if a free sector of memory of 
size larger than that of the inquiry is chosen). As a consequence we can show that 
more than one inquiry on memory can be satisfied by a single sector of free memory. 
This means that processes of satisfying inquiries on memory can be modeled by the 
notion of packability of number partitions. In fact, suppose that, in correspondence 
with the model 5.1, the inquiries, interpreted as partitions (k),...,k,) + k&, must 
necessarily be satisfied in memory, for which the free sectors correspond to parts of 
the partition (n),...,nr7) nandk <n. 

In correspondence with the definition of packability, the partition (k,,...,k,) can 
be packed into the partition (n,,...,n,) ifa part k; of (k,,...,k_) can be grouped into 
r batches (each part k; goes into a single batch, and empty batches are discarded) 
such that after addition of all parts k; in each batch we obtain r numbers p; < nj 
it = 1,...,r. Moreover, in a concrete packing process each n; in (n,...,Mr) is used 
at most once, i.e. the fragment of size n;, in which a batch of inquiries occupies 
size pj <7;, is not used anymore for distributing inquiries k;, even if n; — p; > 0. 
Consequently, the notion of packability of partitions is an adequate interpretation 
of the process of satisfying inquiries on free computer memory. 


As an illustration we consider a concrete numerical model. 


EXAMPLE 5.1. Suppose the batch of inquiries on memory consists of sizes (5, 2, 1), 
while the system of sectors of free memory consists of sizes (6,3,3). Then the 
inquiries can be simultaneously satisfied, and, moreover, in a nonunique manner: 
(6 contains 5 and 1, 3 contains 2, 3 contains 0), (6 contains 5, 3 contains 2 and 1, 
3 contains 0), (6 contains 5, 3 contains 2, 3 contains 1). This manner of grouping 
inquiries for distributing them in sectors of free memory precisely reflects the actual 
work of algorithms of dynamic memory allocation. 


Using this terminology, we consider the application of results of solving extremal 
combinatorial problems in the design of methods for controlling computer memory 
allocation. 
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2. Design of algorithms for controlling computer memory allocation 


There are various algorithms used to stave off the influence of external fragmen- 
tation of computer memory. However, in the realization of any such algorithm, in 
order to identify failure in satisfying inquiries one consults the whole list of free 
memory sectors. Moreover, this list is consulted for each inquiry separately. This 
consultation of the list of free memory involves an expense of computing resources 
of the central processor (processor time). If as the result of the consultation a nec- 
essary part of free memory is not found, the entering inquiry cannot be satisfied in 
this case, and the processor time used in the consultation turns out to have been 
used in vain. This algorithm is used in practice in all Soviet, and in the majority 
of foreign, computers. The great intensity of the flow of inquiries on distinguishing 
memory in the process of functioning of a computer, and also the high frequency 
of the memory situations considered above, reduce the productivity of computing 
systems in the large. For example, in correspondence with estimates in [90], the loss 
incurred by control of memory makes up around 12% of the overall productivity of 
a system working in a time-sharing regime. 

For computing systems in whose process of functioning there does not arise a 
queue of inquiries on distinguishing free memory, one may exclude the nonuseful 
expense of processing time for consulting the list of free memory by comparing the 
sizes of entering inquiries with the size of the maximal sector of free memory (this 
quantity must be stored in the system and must be dynamically corrected in the 
process of functioning). 

However, in the functioning of simultaneous multiprocessor computing complexes 
queues on distinguishing memory do arise. They are formed because of conflicts 
when addressing common data (the list of free memory, etc.), and also in the re- 
alization of mechanisms of re-allocation and reorganization of memory [48]. Such 
queues (batches of inquiries) can serve as a source of a priori information on the 
nature of the flow of inquiries on distinguishing memory. This makes it possible to 
increase the efficiency of the use of computer memory on account of the possibility of 
a more rational planning of distributing information objects in address space of free 
memory, and allows a reduction of expense in processor time for controlling memory 
allocation on account of a reduction of the number of consults of the list of free 
memory. Nevertheless, in the design of modern algorithms for memory allocation, 
queues of inquiries are not taken into account. 

A partial solution of the problem of designing algorithms for memory allocation, 
taking into account possible formation of queues of inquiries, is made possible by 
the results of solving extremal combinatorial problems on packability of number 
partitions (Theorem 2.1). In terms of the models 5.1 and 5.2, the main problem 
in satisfying batches of inquiries consists in establishing the possibility of packing 
the partition (k,,...,k,), interpreted as the sizes of the inquiries in the batch, into 
the partition (n),...,n,), interpreted as the sizes of fragments of free memory. Ac- 
cording to Theorem 2.1, the partition (k,,...,k;) can be packed in the partition 
(Tig; dae tip re 


t > max (k—|2/+1,1), 


where ¢ is the number of inquiries in the batch and k = t_, ki, n = jai Ny. 
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In the formal statement of this problem, finding the quantity max(k—|n/r[+1, 1) 
means that when determining the possibility of satisfying each inquiry of a batch 
with sizes (k,,...,k,) in fragmented address space of free memory it is not required 
to consult ¢ times the list of free memory. For this it suffices to preserve in the 
computer only data regarding the total size and amount of parts of free memory, 
and also data regarding the amount of inquiries in a batch and their total size. 
Precisely the use of these data in determining the possibility of satisfying entering 
batches of inquiries ensures a complete absence of expense in processor time for 
nonuseful consultation of the list of free memory, since if the packability conditions 
are fulfilled, then the result of consulting the list of free memory always gives free 
sectors of address memory space which can satisfy each inquiry in the queue. 

However, determining max(k — |n/r[ + 1,1) is not the final answer to the problem 
posed. Every new result related to the solution of extremal combinatorial problems 
on the set of number partitions, partially ordered by packability, always gives also a 
solution of the problem of designing an algorithm realizing this packing. Therefore it 
is now necessary to determine or construct an algorithm that, when the packability 
conditions hold, would guarantee a complete allocation of the inquiries in the batch 
in computer memory (would guarantee packability of the parts of the partitions). 
By definition, a packing of a part k; of a partition (k,,...,k,) into a partition 


(n),...,m,) transforms the partitions to the forms (k1,...,ki-1,kisi,...,k,) and 
(nq, rn ki, oe ores 

This means that after packing each part k;, the rank of (k;,...,k,) is less by 
one. For nj = kj the rank r of (n1,...,n,) is also less by one, while if n; > k; it 


is unchanged. This interpretation of packability of parts of partitions adequately 
formalizes the working of an algorithm for memory allocation. In the proof of the 
following proposition, allowing us to choose an algorithm looked for, we understand 
the procedure of packing in just this manner. 


PROPOSITION 5.1. If partitions p,(n) = (mi,...,Mr) and q@(k) = (ki,..., ke) 
satisfy the condition t > max(k — |n/r[ + 1,1), then the packing (k,...,k:) C 
(n1,...,Mr) ts guaranteed by every algorithm that distributes parts equal to one at 


the last stage. 


PROOF. It suffices to consider the case n = k. Let k; < n;. Clearly, after packing 
k; in n; the rank r does not change, and complete packability is guaranteed if k — 
|n/r| > k-—ki- |(n — ki) /r| +1, which, in turn, is equivalent to |(k — (r — 1)ki)/r[ > 
|(k + r)/r|. The last inequality clearly holds if k > |r/(r — 1)[ = 2. If ki = n;, the 
sought inequality has the form k — |k/r| >k—k,-—|(kK-—k)/(r-D[+1. O 


We assume that |(k + (r — 1)ki)/(r — 1)[ > |k/r[, but that this inequality does 
not hold if k; > 2. Thus, the proposition has been proved. 

Consequently, if the condition of Theorem 2.1 (the condition for packability of 
partitions) holds, then by Proposition 5.1, to satisfy inquiries in a group we need 
not only require them to be ordered, but also require ordering by magnitude of the 
sizes of the sectors of free memory in the list. In this case it is only necessary to 
allocate the inquiries k; = 1 at the last stage. A rule for choosing an n, to satisfy 
the inquiry of size k; can be given as follows: 


j=min{j: ki <n;}, l<j<r, 
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where r is the amount of sectors of free memory, which are, possibly, not ordered 
by magnitude. This rule is realized by the first-fit algorithm for allocating 
memory [41], which is the fastest allocation algorithm, i.e. requires for its working a 
minimum of expense in central processor time. This means that by first. verifying the 
fulfillment of the packability conditions before starting the algorithm allocating the 
memory, we can without consulting the list of free memory determine the possibility 
of satisfying an entering batch of inquiries. If these conditions are fulfilled, then the 
first-fit algorithm (with inessential reworking, in accordance with Proposition 
5.1) will completely satisfy the inquiries in the batch, without the use of some or 
other means of reorganizing memory. 

We must note that fulfillment of the packability conditions for a batch of ¢ inquiries 
with total size k indicates the presence of a reserve of free memory of volume at least 
r(k—t)—k+r, where r is the amount of sectors of free memory. Clearly, in computing 
systems that characteristically process inquiries on distinguishing memory that are 
large in size (tens or more quanta), the working according to such an algorithm 
may lead to the appearance of a large reserve of free memory. However, for systems 
in which the inquiries on memory are not large and are of the same quantum, in 
the process of their functioning a reserve of free memory will be created whose 
size will dynamically reduce in dependence of the nature of the flow of inquiries 
on distinguishing memory (the more entering inquiries, the larger the size of the 
reserve of free memory). This reserve may be used by the computer for protection 
from deadlock situations in operation memory. The quantity of this reserve may 
be reduced by using the principle of complete packability (see Chapter 2) to verify 
the packability conditions. In this case we need additional information on the sizes 
of the inquires in batches, although the solution of this problem is polynomially 
complex. 

The given application of results of solving extremal combinatorial problems is not 
the only one. It and other extremal results may be used for studying the process of 
job execution in automatic control systems, in choosing the sizes of operation and 
external computer memory, and in the analysis of specifics of software structures of 
automatic control systems. 


3. A combinatorial model for studying the process of job execution in 
an automatic control system 


The functioning of an automatic control system is made up from a set. of ran- 
dom processes and phenomena of various complexities. Their study is basic for the 
increase of the efficiency of organization of the computing process of an automatic 
control system in the large. However, many of these processes and phenomena can 
only with great difficulty be modeled by analytical methods of study. As a result the 
analytical models developed turn out to be unsuitable for even obtaining bounds on 
the values of the parameters under investigation. 

The use of methods from combinatorial analysis for studying the process of func- 
tioning of an automatic control system makes it possible to built from combinatorial 
schemes more adequate formal models of the elements, processes, and phenomena 
under investigation. The unification of these models into combinatorial schemes on 
the basis of common parameters guarantees, on the level of estimation of parameter 
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values, the possibility of an analysis of the mutual influence of parameters, as well 
as of their influence on the process of functioning of the automatic control system. 
To substantiate this thesis, we consider a combinatorial model for job execution in 
an automatic control system. 


MODEL 5.3. A job in an automatic control system is a function to be realized 
by it by executing a program or sequence of programs. If some function in an 
automatic control system is realized, in dependence on initial information, by various 
sequences of programs, then in the proposed combinatorial model such a distinction 
in realization is regarded as a distinction of functions to be realized, i.e. in the model 
it is assumed that each function to be realized by the automatic control system is 
executed by a rigidly fixed sequence of programs. This assumption does not impose 
any restriction on the generality of using the model, since in designing the automatic 
control system we always have the possibility of such a detailed representation of 
the functions to be realized by it. 

The proposed model allows us to investigate the automatic control system under 
the following restrictions on the process of using it: 


e the job serving mechanism in the automatic control system is such that for 
each terminal a subsequent job can be initialized only if the previous job 
given with the same terminal is completely executed; 

e the software structure of the automatic control system is fixed and intended 
for realizing finitely many functions processing information, and is given when 
designing the system. 


This organization of the functioning of the automatic control system is typical 
in a number of systems of a similar class, which also ensures the generality of the 
proposed model. 

Suppose the automatic control system is intended for serving f terminals and 
realizing various functions, by executable jobs z € Z, where Z is the set of all 
functions realisable by the automatic control system. Let Q; be a set of jobs, each 
of which can be initiated at the terminal 7 € [f] = {1,..., f}, where any Q; and Q; 
may intersect. We call Q; a description of the functional assignment of terminal 7. 
Taking in correspondence with the restrictions of the mechanism of job execution 
in the automatic control system, it allows us to assume that at any time instant 
the automatic control system can execute at most f jobs simultaneously. Then the 
set of distinct families of jobs that can be simultaneously executed in the automatic 
control system is defined as a direct product Q = J[{-,Q; whose elements g = 
(z1,.--,2¢) € Q are called complete families of jobs for the designed allocation of 
functions between terminals in the automatic control system and the given sets Q; 
(z; are jobs intended to be induced by terminal 2). 

In the process of functioning of the automatic control system the formation of 
any gq € Q is preceded by a set of different successive states of the automatic control 
system, characterized by the simultaneous execution of jobs. Let gq = (2,..., Zs); 
for this g we consider two states of the automatic control system, characterized by 
the execution of respectively one (z) or two simultaneous (21, 22) job(s). It is clear 
that in the process of functioning of the automatic control system the sequences of 
its states transforming the system from (z)) to (2), 22) can be very distinct. Here we 
are not interested in the number of times that the system is in a certain state, since 
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our problem is the investigation of the set of all possible states of the automatic 
control system. To conduct this investigation it suffices to fix only the possibility 
that the automatic control system is in some state. This greatly simplifies the model 
for describing the set. of all states of the automatic control system, taking (z,) and 
(21, 22) to be adjacent states if there is a transition of the automatic control system 
from (z,) to (21, 22), and using the following approach to describe the whole set of 
states of the system. 

Let Q(q) = {air a Ca}, G = (Ziy5---) 2%), (1,---,%2) © 241, where 2l7 is the 
Boolean of the set [f]. In correspondence with the definition of Boolean (see 
Chapter 1), it is clear that the elements of Q(q) are all possible pairings from 
q = (2,...,2,). Hence Q(q) can be interpreted as the set of all states of the 
automatic control system, characterized by the execution of one, two simultaneous, 
three simultaneous, etc. up to f jobs from gq € Q. However, using Q(q) we can also 
interpret all possible queues of jobs when forming a concrete full family g C Q. In 
fact, let |qi| be the amount of jobs making up q € Q(q). To form q € Q, in the 
system there must already been realized f — |qgi| such jobs, which is the comple- 
ment g; of qi in gq, ie. Gg = q \q. Using the rule for constructing Q(q), it is not 
difficult to prove that if gi = q \ q@ and q; € Q(q), then g € Q(q). So, by giving 
the bijective map ¢: Q(q) — Q(q) such that $(q;) = Gq, we obtain pairs of elements 
(qi, Gi), characterizing each state of the automatic control system by simultaneous 
execution of the jobs in qg; and the corresponding state of the queue of jobs to be 
executed when the system transfers from q; to the state characterized by the jobs 
from z € Q. Using this approach to define intermediate states for all gq € Q, we 
obtain a set W(Q) = {(w,w): w= ¢\w, w € Q(q), g € Q}., whose elements allow 
us to obtain a priori information for estimating the values of certain parameters of 
the functioning of the automatic control system. 

The cardinality of the set W(Q), or the amount of elements in it, determines 
the amount of calculations necessary for studying the whole variety of states of the 
automatic control system. Using the rule for constructing complete families gq € Q 
and the rule for constructing Q(q), it is not difficult to prove that the cardinality of 
the set W(Q) is defined by |W(Q)| = 2/ - TI, |Q:|, where 2/ is the cardinality of 
the Boolean of the set [f] = {1,..., f} and |Q:| is the cardinality of Q;. There are 
various applications of this model in studies of the functioning of automatic control 
systems. As an example we consider its use in estimating from above the necessary 
size of operation computer memory. However, for this we need consider another 
series of combinatorial models, formalizing the process of memory allocation and 
the influence of external fragmentation. 


4. Combinatorial models for estimating the necessary size of computer 
memory 


The size of the operation computer memory has an essential influence on the 
capacity of the automatic control system. If the size of operation memory is small, 
then in the process of functioning of the system part of the central processor time is 
lost on controlling memory allocation. Increase in operation memory increases the 
productivity of the system without any change in the programmatic treatment of 
data. Memory will always be a key factor in the productivity of computers. J. von 
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Neumann established this in his 1946 memoir, and it is true today [92]. 

In the design of real-time automatic control systems problems of estimating the 
size of operation memory are given attention at practically all levels of creation of 
the system. To this end complex imitation models have been created with whose 
aid one studies, basically, the behavior of the system under peak loading, i.e. in time 
periods when the average quantity of flow of jobs in the system takes a maximal 
value [14], {92]. These studies require an essential amount of computing resources 
in the development. of the system, and increase the time for creating it. However, 
such studies are necessary, since precisely under peak loadings a real-time automatic 
control system must remain running. 

We must. also note that efficient functioning of a real-time automatic control 
system is not possible without the fulfillment of a very important condition: as a 
design outcome, software should ‘correspond’ to the machine; it should be designed 
such that it does not reduce the productivity of the machine and of the system in 
the large [26], [92]. The approaches to solving the problem of controlling memory 
allocation used nowadays are such that a final solution emerges, as a rule, only at 
the stage of running the system. 

The above-given short analysis of requirements on methods for controlling the 
allocation of operation computer memory in real-time automatic control systems 
allows us to draw the following conclusions: 


e a method solving the problem of designing control of operation memory allo- 
cation must minimize the expense of computing resources, and must guaran- 
tee that a theoretically substantiated algorithm can be obtained at the stage 
of technically designing the automatic control system; 

e a method for controlling memory allocation must be designed while taking 
into account the specifics of the software structure of the system; 

e a method for controlling the operation computer memory must minimize 
the expense in processing time for allocation, and must guarantee efficient 
execution of functions of the real-time automatic control system under peak 
loadings on the system; 

e a method for controlling the operation computer memory must impose fewest 
possible restrictions in satisfying inquiries on distinguishing memory, and 
must guarantee fast emptying of segments of memory not taking part in the 
computing process. 


For the solution of these problems we propose a number of combinatorial models. 
We have chosen the sequence of presenting these models in correspondence with 
the increase in a priori information on the process of functioning of the operation 
computer memory of the automatic control system. In our models we will also 
consider the possibility of batch satisfaction of inquiries, i.e. when the inquiries on 
distinguishing free memory enter in batches. The advantage of a batch method 
for satisfying inquiries lies in the presence of additional a priori information on 
the nature of the flow of inquiries in memory, which is here taken into account by 
way of considering batches of inquiries ordered by size. We also consider a model 
of the process of allocating operation computer memory when a solitary method 
of satisfying inquiries on distinguishing memory is realized, i.e. when inquiries are 
satisfied in the order of their appearance. 
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MODEL 5.4. We consider the functioning of a computer in which inquiries on dis- 
tinguishing memory occur in batches. Let the sizes of the inquiries in a batch enter- 
ing at an arbitrary moment of time correspond to parts of a partition (k,,...,k;) Fk 
At this time moment the free memory is represented by r sectors, with total size 
n. Then by the principle of complete allocation (see Chapter 2) the computation of 
the quantities 


nk, . _ ker) = Tees (5 + (ki ~~ Dir = »| ) 
j= 


under the condition k, > --- > k;, allows us to find a total amount of free memory 
which, being represented by any partition on the r continuous free sectors in address 
space, makes it possible to completely distribute all k,,...,k; in it without re- 
allocation and reorganization. The formula of the principle of complete allocation 
implies that for solving this problem we do not need information on the sizes of 
the free sectors of memory, and thus for none of the inquiries in the batch need 
we examine the list of free computer memory. To this end it suffices to verify the 
validity of the inequality n > n(ki,...,k:3 7), where r is the amount of fragments by 
which the free memory of size n is represented. 

If the inequality is satisfied, then the proof of the principle of complete alloca- 
tion implies that for satisfying the inquiries (k,,...,k,) we can use any algorithm 
of dynamic memory allocation that uses the fact that the inquiries are ordered by 
decreasing sizes. In other words, all inquiries of the batch k,,...,k;) can be si- 
multaneously satisfied by the free memory (n1,...,n,), e.g. using the first-fit 
algorithm, if the inquiries are chosen from the queue in the order of decreasing sizes. 
Hence, the principle of complete allocation may be used in the design of methods 
for dynamic computer memory allocation in automatic control systems. 


EXAMPLE 5.2. Suppose the sizes of the inquiries are represented by parts of the 
partition (ky,...,k,) = (22,13, 12,8,4,2,2,1) + 64, while the continuous sectors 
of address space in free memory correspond to (n1,...,Mr) = (23, 21, 21, 20) F 85. 
Then 

i Rigceug her) 122,013, 1258,4,2-9, 154) = 85, 
and hence we can place all represented inquires, e.g. as follows: (23 = 22 + 1, 
21 = 13+8, 21, 20 = 12+44+2+4 2). If, however, (n1,...,m,) = (22, 21, 21, 20) F 84, 
then the principle of complete allocation does not yet imply the required packing, 
despite the fact that packing is possible. ‘This characterizes ‘zones of indeterminacy’ 
in extremal combinatorial bounds. 


MODEL 5.5. Suppose we have m batches of inquiries on distinguishing memory. 
The sizes of the inquiries in the jth batch correspond to parts of the partition 
(ke ) ao ke )) (7 = 1,...,m). In agreement with the principle of complete allocation 
it is not difficult to prove that computing the quantity 


(3) (3). 
max (ky Bees ki 1), 


under the conditions kU esos ki : makes it possible to find the total size of free 
memory every partition of which in at most r continuous address sectors guarantees 
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that. all inquiries on memory in the jth batch can be satisfied (1 < 7 < m). The 
algorithm for placing the inquiries is the same as in the model 5.4. 


MODEL 5.6. Suppose the set of possible states of occupied computer memory is 
interpreted as the set of number partitions (d®, ee ,d), a= 1,...,l, where every 
part of the ith partition d® corresponds to the size of the rth occupied sector of 
address memory space, and vu; is the amount of occupied sectors, represented by the 
parts of the zth number partition. For each ith state of occupied memory there is 
known a set of batches of inquiries, in the course of functioning of the automatic 
control system each of which requires simultaneous satisfaction of all inquiries in it 
for the zth state of occupied computer memory. 

Suppose the set. of number partitions (ko, does, ke yo corresponds to this set of 
batches of inquiries for the zth state of occupied memory; the parts of these partitions 
correspond to the sizes of the inquiries on memory, and the rank t; is the amount 
of inquiries in the jth batch (i = 1,...,l; 7 = 1,...,m;). Taking into account the 
influence of external fragmentation, we obtain that in the process of functioning of 
the automatic control system when satisfying each batch of inquiries corresponding 
to the ith state of occupied memory, the free computer memory turns out. to be 
‘scattered’ across at most v; + 1 continuous sectors of address space, where 1; is 
the nuinber of occupied sectors of memory corresponding to the ith state. Then an 
upper bound for the necessary size (V) of computer memory that, in the process of 
functioning of the automatic control system, would suffice for satisfying any entering 
batch of inquiries, taking into account the principle of complete allocation, can be 
computed by the following expression: 


V = max ( max n((ky?,..., ke); v5 +1) + ya | 


I<i<t \1<j<mi cer 


for Ro? > D> Ke, where n((k, ver KPO; vy; + 1) is defined by the formula for 


complete allocation for each partition ( G ) ihe Ske ) ), and 3*?_., d is the total size 
of occupied memory under satisfaction of the inquiries in a batch. 


This approach to choosing the size of operation computer memory of an auto- 
matic control system, as in the preceding models, assumes that it is possible to use 
algorithms of dynamic memory allocation which take into account the batches of 
inquiries and ensure satisfaction of the inquiries in these batches in the order of 
decrease of their sizes. However, nowadays the majority of computers in automatic 
control systems do not. use the batch method of satisfying inquiries on memory. ‘The 
construction of an analogous model for estimating the required size of operation 
computer memory under realization of an individual method of satisfying inquiries 
is made possible by a somewhat different formal combinatorial model of entering 
batches of inquiries in the process of functioning of the automatic control system. 
Suppose we have a batch (k,,...,k,) of inquiries on distinguishing memory, which 
must be satisfied in the order of entrance. In a finite time span, all inquiries must 
be simultaneously satisfied in free memory, which at the moment the first inquiry of 
the batch enters is represented by r continuous sectors of address space. Clearly, the 
individual satisfaction of the inquiries in the batch under consideration presupposes 


170 5. APPLICATION OF THE RESULTS OF SOLVING EXTREMAL COMBINATORIAL PROBLEMS 


that the numbers ky,...,k; are in arbitrary order. In accordance with Lemma 2.4, 
if 


fe -kir)= pa, (Soh (4D), kw EN, 
then for any sequential ordering of the numbers k,,...,k, we have the inequality 
F(R, -+-, kaj) 2 UK yxav4 KET) 


It is then clear that to estimate the required size of free computer memory 
under individual satisfaction of inquiries it is necessary to find the maximum of 
f(ki,...,k;r) on the whole set of permutations of the numbers (k),...,k,). By 
Lemma 2.4 this maximum is equal to 


t 
k+ (max ki - 1) (r—1), herr Doky 
However, this result is not the final solution to the problem. Before turning to the 
description of a model for studying the individual method of satisfaction of inquiries, 
we must formulate the <lgorithm by which we can realize the packing. The algorithm 
realizing packability for an arbitrary ordering of the parts of a partition (ky,..., ky) 
is as follows. 


PROPOSITION 5.2. Suppose we are given partitions (ki,...,k:) / k and 
(n1,...,Mr) F n for which the inequality n > k + (maxiciceki — 1) (r — 1) holds. 
Then (ky,...,k;) © (m,-..,,-), and this packing is realized by the following algo- 
rithm: 

e the parts k; for distribution are chosen in the order of their labeling; 
e each part k; 1s distributed in the first part n; that has appropriate size, 1.e. 
n; 1s chosen by the rule 


i= min(t: kj <n), 1 Say, ee 
e when choosing an n; > k;, there remains a single part nj — k; > 0. 


PROOF. We proceed by induction with respect to ¢. For t = 1 the main inequality 
takes the form n > rk; — r + 1, and the required is fulfilled by Dirichlet’s principle. 
We make the induction step. Assume that the required is fulfilled up to t — 1 
inclusive; we show that it is also true for t. The packing (ki,...,k:) © (m,...,¢) 
follows from the principle of complete allocation and the fact that n(k,,..., kr) < 
k+(maxy<icz ki — 1) (r—1) by Lemma 2.4. So, for k; we can always find an n; > ky. 
After packing the part k,, the required will follow from fulfillment of the packability 
condition for the partitions (k,,...,k:) and (m1,...,ni — ki,...,n,). To prove this 
packability we use the induction hypothesis. To this end it suffices to verify the 
truth of the inequality 


n— ky > k— hy + (soaxki—1) (1), 


where 
f for ni > ky, 
v= 


r—1  forn; = ky. 
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In fact, by requirement, 
n>k+ (roax bi - 1) (r—1)+ (max bi - 1) (v— 1), 
1<i<é 1<i<t 


and thus 
n—k >k—-—k+ (rma bi - 1) (v — 1), 
2<1<t 


as required. O 


MODEL 5.7. Let the set of possible states of occupied computer memory be rep- 
resented by the set of number partitions (d\, eagiay) Fd), 7= 1,,..,L For 
each element of this set there are m; batches of inquiries on distinguishing mem- 
ory known, represented by corresponding partitions ( (3) as Re Y@ ig eee 7 
k® being the size of the rth inquiry in the jth batch, and t,; being the amount 
of inquiries in the 7th batch. In the process of functioning of the computer of the 
automatic control system the inquiries of any jth batch enter in arbitrary order, 
and are individually satisfied in correspondence with the order of entrance and re- 
quire simultaneous distribution in memory. At the moment of entrance of the first 
inquiry of each jth batch, corresponding to the ith state of occupied memory, the 
free computer memory is represented by v; + 1 continuous sectors of address space. 
Then by Lemma 2.4 this model allows us to give an upper bound on the required 
size (V’) of operation computer memory, 


Vv 
vi= max ( max (4% + ( max, 46-1) x} +304), 
1<i<l \ l<3<m i<v<i = 


where k{') is the size of the vth inquiry on memory in the jth batch, corresponding 
to the ith state of occupied memory; ie is the amount of inquiries in the 7th batch, 
corresponding to the ith state of occupied memory; 


Kod) = 7 RED), 
v=l1 


Proposition 5.2 makes it possible to prove that when the computer of the auto- 
matic control system functions with operation memory of size V’, then satisfaction 
of any inquiry is guaranteed by the first-fit algorithm of dynamic memory allo- 
cation. 

We must note that for the computation of V and V’ in this case we do not need 
information on the sizes of the continuous sectors of free address space. This is 
in agreement with the formal interpretation of the principle of complete allocation. 
However, if at some stage of the design of the automatic control system additional 
a priori information on the functioning of the system element under investigation 
becomes available, then, using other results concerning the packability of number 
partitions, we can refine the values of the parameters under investigation. Suppose 
that the sizes of the inquiries in a batch entering at an arbitrary moment of time 
correspond to the parts of a partition (k),...,k;) Hk. At this moment of time the 
free memory is represented by r sectors with total size n, and, in distinction from 
the models 5.4—5.7, the sizes of all r sectors of free memory are known, which gives 
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corresponding parts of a partition (n,...,n-) kn. This information may be used 
to improve the extremal bounds of the required size of operation computer memory 
guaranteeing satisfaction of the inquiries on memory in batch service. In fact, if 


eae ee 


m = min(m: 33™,n; > k), and if the inequality 


max. (200 — n(ky,..., ke; 0) > 0 


is satisfied, then all inquiries on memory (k;,...,k;) / k can be simultaneously 
satisfied in the fragmented address space of free memory with sizes of free sectors 
to which correspond the parts of the partition (n;,...,n,) / n. To clarify this we 
consider an example. 


BEXAMPLE:5.3, Let t= 7 = 3, (hyj.004 hy) = (5; 2; 1) 8, as ss.) = (6,3,3) F 
12. Then n(5,2,1;3) = 13 > 12 = 64+ 3+ 3, and thus a straightforward use of 
the principle of complete allocation, without taking into account information on the 
sizes of the sectors of free memory, will not guarantee satisfaction of the inquiries. 
However, for | = 2 we find that. n(5, 2,1;2) = 9 = 6 + 3, and thus, by the principle 
of complete allocation, (5,2, 1) C (6,3). Since 


Ghiseiny ti) S CO errr) emer cre 


we have (5,2,1) C (6,3) € (6,3,3). Hence, by transitivity of packability we find 
(5,2,1) C (6, 3, 3), so that after all satisfaction of the inquiries is guaranteed. 


On the other hand, if in the clarification of the possibility of satisfying entering 
batches of inquiries on memory we take into account the sizes of the continuous 
free sectors of address space, then, using the extremal result from Theorem 2.3, we 
can also refine the upper bound on the free memory which is sufficient. to guarantee 
satisfaction of the entering batches of inquiries. We demonstrate the use of Theorem 
2.3 by constructing a number of models. 


MODEL 5.8. Consider a computer in an automatic control system in which in- 
quiries on distinguishing free memory enter in batches. To determine the possibility 
of satisfying all inquiries in a batch in fragmented address space of free memory we 
may use Theorem 2.3, whose essence lies in finding the quantities 


<i< 
1 t 1p 


Nehiss eek Nie ccs, ip) = max (4 RCT — n), 


r < >¥¢n,, under the conditions k} > --: > ky. If n(ki,..., ke; ne,...,mr) < Wye n1, 
then (k,...,k¢) C (m,...,m,r), where (ky,...,k:) / k, (ny,...,n-) - n are parti- 
tions of k and n, respectively. 

The formal source of this result becomes apparent when interpreting the number 
partitions (k,...,k:), (m,...,m,), and also packability of number partitions, in 
accordance with the definitions in models 5.1 and 5.2. To determine the possibility 
of simultaneous satisfaction of inquiries in a batch, whose sizes are interpreted as 
parts of a partition (k),...,k,) / k, in fragmented address space of free memory, 
interpreted by the partition (n,,...,n-) / n, we have to verify the truth of the 
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inequality n(k,...,ki;ne,...,mr) <n. Here the quantity n(k),...,kt,no,...,nr) 
is the minimally necessary total size of free memory which, being represented by r 


continuous sectors of address space with sizes p),...,p,, respectively, with p; < nj, 
1 = 2,...,r, guarantees the possibility of simultaneous satisfaction of the inquiries of 
sizes k,,...,k,. Hence, if for an incoming batch of inquiries on memory (kj,... , kt) 


the inequality 
r 
Lt cee rk eee | oo eS Son; 
j=1 


holds, then all these inquiries can be simultaneously satisfied in fragmented address 
space of free memory (n),...,m,) without application of the means of reorgani- 
zation or re-allocation. As in the models 5.4-5.7, for determining the possibility 
of satisfying a batch of t inquiries we need not consult the list of free memory t 
times. Use of information regarding the sizes of continuous sectors of address space 
of free memory requires in total one consultation of the list of free memory in order 
to determine whether ¢ inquiries can be simultaneously satisfied. If the inequality 
n(ky,...,kt;ne,--.,Mr) < XLF., ny holds, the guaranteed allocation of each inquiry 
k; is ensured by any imaginable algorithm for dynamic memory allocation that 
uses in its realization the fact that the inquiries are ordered by decreasing size, i.e. 
k; >--+- > k;. For example, all inquiries in a batch (k),...,k;) can be satisfied in 
fragmented address space of free memory with sizes of sectors (n),...,n,r) using the 
first-fit algorithm, if the inequality n(k),...,ki;n2,...,mr) < 51 nj holds and 
the order of choosing the inquiries from the queue corresponds to k; > --- > ky. 
In other words, if we use Theorem 2.3 as criterion to determine the possibility of 
simultaneous satisfaction of inquiries in the control of computer memory allocation, 
then, independently of the rule of choosing the free memory sectors of sizes n; over 
which the inquiries k; are to be distributed, all inquiries will be distributed if they 
are chosen from their queue in the order of decreasing size. Hence, Theorem 2.3 
can be used also to design methods of dynamic computer memory allocation in an 
automatic control system. As a clarification we consider the same example as in 
model 5.2. [n it n(5, 2,1;3,3) = 11 < 12 =6+4+3+3, and hence the required distri- 
bution can be realized. Moreover, from the determination of an extremal bound for 
n(ky,...,kp3ng,...,n,) it follows that not only a packing 


Rissa 3 Ril Ceti oat) Pt SA Rien Reg ee The) 


holds, but also the packing of (k),..., k;) into any partition (p),...,p,) / n for which 
pi <1, 1 = 2,...,r. So, in the already considered case (ki,...,k:) = (5,2,1) and 
(n},.-.-,Mr) = (6,3, 3) we have n(5, 2,1; 3,3) = 11, and hence the partition (5, 2, 1) 
can be packed into the following rank-3 partitions of the number 11: (9,1, 1), (8, 2,1), 
(7,3,1), (7, 2,2), (6,3, 2), (5,3, 3). We note that there are rank-3 partitions of 11 
into which the partition (5,2, 1) cannot be packed; clearly, (4, 4,3) is such. 


MODEL 5.9. Suppose we have m batches of inquiries for distinguishing memory. 
To the sizes of the inquiries in the 7th batch correspond the parts of the partitions 
(KY? ) sea Pepe } = 1,...,m. Suppose we are also given a state of free computer 
memory, interpreted as a partition (n),...,n,) / n, which, in accordance with some 
chosen criterion (e.g., the worst, from the point of view of distributing the inquiries 
over it), characterizes the external fragmentation in the process of functioning of the 
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computer of the automatic control system. Then, in accordance with the definitions 
in Theorem 2.3 we can prove that the quantity 


max n(k1,..., hy; ne, .--,Nr) = max ( max (>: k@ + Smee = 1) : 


l<j<m l<j<m \1<i<t; nar 
where kU ye hastS k? ) is this minimal total size of r fragments of continuous address 
space of free memory of sizes p),..., Pp With p; < n; (t = 2,...,r), respectively. And 
these sizes p; (4 = 1,...,r) guarantee the simultaneous satisfaction of all inquiries 


of any jth batch (1 < 7 < m). In this model the algorithm for distributing inquiries 
can be chosen similarly as in model 5.8. 


MODEL 5.10. Suppose the set of all possible states of free computer memory of an 
automatic control system is stp as the set of number partitions (n® .,n)), 


i= 1,...,l, where any part ni ) of the ith partition corresponds to the 7th contin- 
uous sector of address space of free memory, and uv; is the amount of continuous 
sectors of address space of free memory. For each ith state of free memory we 
know the set of batches of inquiries, each of which may require, in the course of 
functioning of the automatic control system, simultaneous satisfaction of all its in- 
quiries, given the ith state of free computer memory. Suppose that to this set of 
batches of inquiries for the ith state of free memory there corresponds a set of par- 
titions (k? Mobis ke ya), where the rank t; denotes the amount of inquiries in batch 
j(@=1,...,4; 7 = 1,...,m). Then the definitions in Theorem 2.3 imply that, in 
accordance with model 5.9, the required size of operation memory is the quantity 


max ( max n CP Gin) 250) = 


I<jsl \Isjsgm 
= (3) @) (3) 
rs, (ma a(S ky? + 3 min(n;’, k; J 


where kW? > > ke 


We may distinguish two peculiarities of the combinatorial models presented here, 
which essentially simplify the study of the process of dynamic computer memory 
allocation: 


e the interpretation of the states of free (occupied) memory by number parti- 
tions on the one hand makes it possible not to take into account a large part 
of the states of computer address space, and on the other hand this charac- 
terization of memory states includes all necessary parameters to sufficiently 
well reflect the influence of external fragmentation; 

e in the models the time that computer memory is in some or other fragmented 
state is not taken into account; the characterization using unordered number 
partitions makes it only possible to assert the fact that memory is in a state 
admissible for it. 

However, the use of these additional features in combinatorial models requires yet 
the solution of the problem of finding a priori information regarding the functioning 
of the automatic control system, and in particular regarding the functioning of the 
operation computer memory. The possibility of and methods for obtaining it have a 
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substantial influence on the efficiency of using combinatorial models in the process 
of investigation. The matter is that the looked for a priori data is input information 
for the class of combinatorial models under consideration, so that the completeness 
and degree of exactness of this data in a finite time span influence the results of the 
investigation in the large. 

The use of combinatorial models for solving the problem of finding an upper 
bound for the necessary operation computer memory in an automatic control sys- 
tem supposes, first of all, a determination of the main factors under whose influence 
a state of operation computer memory changes. The reasonings given earlier make 
it possible to say that the sizes of inquiries on distinguishing operation memory are 
one such factor. But the sizes of inquiries, in turn, depend on the sizes of the infor- 
mational objects (programs and array data) forming the software of the automatic 
control system. Consequently, to obtain a priori information on the functioning of 
operation computer memory, we must analyze the data concerning the software of 
the automatic control system. 


5. The use of combinatorial models for estimating the necessary size of 
operation computer memory in an automatic control system 


Each function to be executed in the automatic control system (information pro- 
cessing, all possible calculations, accumulation, renewal, and re-allocation of data, 
etc.) can be realized by various program complexes or software of the automatic 
control system. Using the complexity specifics of elaboration of software of the 
automatic control system, and also the aim of creating the system, related to the 
automation of a process of information processing in some or other concrete do- 
main of knowledge, we can distinguish two main properties of software of automatic 
control systems: 


e modularity of construction of software, making it possible to state in a single 
manner the requirement on the size of free memory needed for realizing some 
concrete function (task) of the automatic control system; 

e functional closedness of the construction of software of the automatic control 
system, i.e. the determination of finitely many program modules, and the 
naming of them, realizing every concrete task, and also of finitely many 
functions to be executed, which during a long time span of exploiting the 
system (between moments of updating it) remain unchanged. 


The presence of such properties makes it possible to give the following definition 
of a software structure for an automatic control system. 


DEFINITION 5.2. A software structure of an automatic control system is a descrip- 
tion of a family of functionally closed linear sequences of program and informational 
modules or informational objects, containing data regarding the maximal sizes of 
operation computer memory required for executing or loading each module and 
data regarding the successive execution (use) of these modules for each job of the 
automatic control system. 


At first glance this conception of job execution in automatic control systems may 
seem erroneous. In fact, many jobs in a system are realized, depending on the input 
data, by a ‘branched’ and not linear sequence of program modules. Moreover, each 
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program module can, in the process of its execution, be a source of inquiries on dis- 
tinguishing memory needed by it to allocate intermediate or output data. However, 
the appearance of such inquiries in a process of job execution by an automatic con- 
trol system has no influence on the nature of the sequence of executing the program 
modules for realizing a concrete function of information processing. Moreover, work- 
ing out the names of jobs in the automatic control system taking into account input 
information during a finite time span, leads to an interpretation of them precisely 
by linear sequences of program and information modules. 

The possibility of such a representation of the software structure of an automatic 
control system is of great importance in the use of the combinatorial approach to 
studies of the process of functioning of the system. Data regarding the software 
structure give already at the stage of designing the automatic control system a 
priori information necessary for applying combinatorial models and for obtaining 
bounds for the values of the studied parameters of functioning of the system. The 
degree of exactness of data regarding the software structure is determined by the 
degree of detail in the design of the automatic control system or by the design 
stage at which this information is obtained. At different design stages, when the 
algorithms implementing functions of the automatic control system have not yet 
been determined in detail, such data may serve as bounds on the necessary size of 
operation memory for implementing each function of the automatic control system. 

It is important to note that the combinatorial models 5.4-5.7 make it possible 
to obtain a theoretically substantiated upper bound for the size of operation com- 
puter memory of the automatic control system when orienting data on the necessary 
sizes of operation memory appear and in the presence of information regarding the 
functional meaning of each terminal of the automatic control system. Moreover, 
the models 5.3-5.7 make it possible to obtain analytic dependence of the bound for 
the necessary operation computer memory of the automatic control system on such 
characteristics as parameters of the software structure, the amount and functional 
meaning of terminals of the systems, etc. 

Consider the example of using combinatorial models for finding upper bounds for 
the necessary size of operation computer memory at various stages of design of the 
automatic control system, i.e. when only orienting data regarding the necessary sizes 
of memory for realizing each job from a system z € Z is available. This data may be 
given as a list of values of the sizes. Certain jobs of the automatic control system may 
be given identical orienting sizes of necessary memory in the design, hence in the list 
there may appear identical entries. This list cannot be regarded as a set anymore, 
since this would contradict the basic property of elements joined by the notion of a 
‘set’. A list having identical entries is a multiset (see Chapter 1). However, in our 
investigations we will more conveniently regard it as a number permutation. On 
the one hand, this does not contradict with the definition of partition, on the other 
hand the use of the notion of set of number partitions does not contradict the rule 
for constructing the set W(Q), expounded in model 5.3. 

Suppose the partition (k,,...,,) | & corresponds to initial data regarding mem- 
ory requirements, while its parts k; correspond to given orienting memory sizes 
necessary for realizing the zth job in the automatic control system. From the initial 
data regarding the functional meaning of each terminal of the automatic control 
system, used in model 5.3, we construct the set W(Q) 3 (w,w). We denote by |w| 
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and |w| the amount of jobs represented by w and w, respectively. By replacing each 
job in w by the necessary size of memory for realizing it, taken from (kj,..., ky) + k, 
we obtain a partition p, = p(ki,,...,ki,) of rank r= |wl,1<i<ll<r< f. 
Thus, in accordance with the definition of w in model 5.3, p, characterizes an ad- 
missible state of computer memory of the automatic control system. The parts of 
this partition correspond to the sizes of occupied sectors of memory, and the rank 
characterizes the maximally possible ‘scattering’ of free memory in the presence of 
r continuous occupied sectors of address space in it. 

In this way we also put corresponding memory sizes in place of jobs in w. A a re- 
sult we obtain a partition p, = (kj,,..-,k,,) of rank ¢ = |w| = f—r,1 <9 <1, which 
determines the maximally admissible aimber of inquiries in a batch and the sizes k;,, 
(1 < m < tt) of these inquiries. This also characterizes the possible flow of inquiries 
on distinguishing memory for the fixed state of computer memory represented by 
pr. As a result we obtain a pair of number partitions (p,, p,), corresponding to the 
element (w,w) € W(Q), characterizing an admissible situation of computer memory 
in the functioning of the automatic control system and, moreover, containing all in- 
formation necessary for estimating, in accordance with models 5.5- 5.7, the necessary 
size of operation computer memory. Putting in this way each pair (w,w) € W(Q) 
into correspondence with the pair of partitions (p,, p:), we obtain a set B(Q) of pairs 
of partitions which are necessary a priori information for giving an upper bound for 
the necessary size of operation computer memory of the automatic control system 
under investigation. 

We can distinguish two basic, and at the present design stage common, properties 
of the parameters of the partitions forming pairs (p,, pr) € B(Q); to wit: 

e the ranks of the partitions in a pair always satisfy the equality r+ t= f; 
e the sum of the numbers from which the partitions p,,p, are obtained always 
satisfies the inequality 


r t 
7 ki; < k’ fi 
dhs + 2 ki, < kf, where k max ki, 


ky © (ki, ..., 41) and ki, kj, are the parts of the partitions p,, p,, respectively. 
Analytic expressions for obtaining upper bounds of the necessary size of operation 
computer memory, in dependence of the implemented method of allocating inquiries, 
can be obtained on the basis of the corresponding models. Suppose that in the 
automatic control system under investigation we have implemented batch allocation 
of inquiries on distinguishing memory. For the computer of this automatic control 
system we can obtain an upper bound for the necessary size of operation memory by 
using model 5.6; in fact, to each state of operation computer memory corresponds 
a unique batch of inquiries on distinguishing memory. Using this fact the required 
quantity V is defined by the expression 


(mms Sok + (Km -9r +h] 


for ky >-+- > k:, where k;,k; are the parts of the partitions p;, p,, respectively. 
For the computer of an automatic control system in which individual allocation 
of inquiries on distinguishing memory is implemented, we use model 5.7 to estimate 


~ (pePeB(Q) 
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the necessary size of operation memory. Taking into account the same peculiarities 
of the initial data, represented by the set B(Q), the calculation of a bound is a 
particular case of that of the expression in the model 5.7, and is defined by 


t r 
Bs + (a,b -1)r+Sa), 


Vi (3: 
(pedJEB(Q) 
where k;,k; are the parts of the partitions p,, pz, respectively. 

Clearly, V < V’, if V and V’ are computed for one and the same set B(Q). 
This relation follows from the rule of constructing the set W(Q), which, in turn, 
determines the collection of elements B(Q). However, if distinct subsets of B(Q) 
are taken to compute V and V’, then this relation may become different. 

It is not difficult to note that the amount of calculations in estimating the size of 
operation computer memory using the expressions given above is mainly determined 
by the amount of terminals served by the automatic control system, their functional 
meaning, and the presence of data regarding software implementing jobs or functions 
of the automatic control system. The rule for constructing the set B(Q) allows us 
to determine the amount of calculations necessary to estimate the size of operation 
computer memory at the present design stage of the automatic control system. The 
amount of operations to be performed in the given case is defined by the quantity 
|W(Q)| = |B(Q)| = 2%- TT, |Q;|, where the operation is taken to be the calculation 
of the expression defining V or V’ for a single element (p,, p:) € B(Q). When data 
regarding the software structure implementing jobs of the suo HG control ven 
emerges, the amount of calculations increases sharply. Let g = (kW a ki? ) bea 
partition, interpreted as the sizes and number of programs volving the necessary 
array data implementing the ith job in the automatic control system. Using model 
5.3, in this case to each job of the automatic control system will correspond not one 
number (a bound on the necessary size of memory for implementing the job), but 
t; numbers (the parts of the partition qg). Assume that under execution of a job by 
the automatic control system the computer memory contains only one sequence of 
programs implementing this job, i.e. that after execution the programs empty the 
operation memory occupied. Then, in accordance with the models 5.3, 5.5-5.7, the 
initial data for the calculation require the execution of 2/ - []/., R(Q;) operations, 
where R(Q;) is the sum of the ranks of the partitions characterizing the software 
structure implementing the jobs intended for initializing the ith terminal of the 
automatic control system. Clearly, []/., R(Q:) > TI/_, |Qi| for t; >> 1. As already 
proved above, in studies using combinatorial models, reduction of the enumeration 
of states of automatic control system elements under investigation is made possible 
by results of extremal combinatorial problems. In this case such a result reads as 
follows. 


THEOREM 5.2. Let p = (k,...,k,) / k, and let Q be the set of all partitions 
q = (ky)jep, where BC 2!"1, ie 
Q= U (ky jen: 
Boalt! 


Let r(p) be the rank of the partition p, (p — q) the partition obtained from p by 
deleting certain parts constituting q, and let |(p — q)| be the sum of the parts of the 
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partition (p—q). Ifk, >--- > k,, then 
max(n(q;r(p — 4) + 1) + [p— gl) =k + (ki — 1)(r — J). 


PROOF. We first prove the upper bound. Substitution of n(q;r(p — q)) gives (in 
accordance with the principle of complete allocation, Theorem 2.2) 


max(n(q; r(p—q)+1)+|p-q) = 


= max (es (3 + (gi — I(r) ro) te | 


qeQ 
q@=(915-+.4t) qi€(p—q) 
q-1l>:->4t 


Since Y4,e(p—q) 9 + sy qj < |p|, 1 <7 <t, the following inequality holds: 


max (ms (3205 (a- Diet ro} 4 » 0) < 


aoe uE(P—4) 
q-12-24 
< pen _ 
<|pl+ max (max(a ~ 1)(r(p) ~ r(q))). 
9= (415-9) 
q-1>--Da1 
But 
ga: Be: a = - 
max (max(a — I(r(p) - r(a))) < (= r=), 
9= (41 5-+459t) 
q-1>-->at 
so that 
ea = = 
pl + max (max(q — D(r(p) -r(@))) < 
9= (41st) 
q-1>->at 


S [ph (=r = ys Rak = 1) 


It remains to prove that the bound is always realized. In fact, for g = (ki) we 
have 


max(n(q;r(p — g) + 1) + [p— al) 2 n((ki); r(p)) + Ip — (ka) = 
=k, + (ky —1l)\(r-1) +k -—ky =k + (ky -— 1)(r—1). 
This proves the theorem. O 


This extremal result substantially reduces the investigated amount of initial data 
when computing upper bounds for the necessary size of operation computer memory. 
In the general case, for each state of the automatic control system, characterized 
in model 5.3 by simultaneous satisfaction of jobs of a complete family, one assumes 
the construction of a set of pairs (q,gi), g € Q(¢), G € Q(q), which characterize 
the states of the system existing before its transition to the state gq € Q. However, 
in transition to actual requirements on memory, characterized by pairs of partitions 
(pr, Pt) C B(Q), the set B(Q) can be made substantially smaller. Let B(q) C B(Q) 
be the set of pairs of partitions (p,, p:) corresponding to (qi,q@). In turn, the latter 
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are given by a bijective correspondence ¢: Q(q) — Q(q), where $(gi) = G = @¢\M%, 
q € Q. Clearly, the parameters of every pair (pr, P) C B(q) satisfy the estimates 


t 
ret =f; do ki + D0 Bin = ki, PSS J 


where k,, are the parts of the partition p,, 1 < 7 <r, kj, are the parts of the 
partition , 1 <m <t and k;, are the parts of the rank-f partition which is the list 
of required operation memory for implementing the jobs in g € Q. Then it is easy 
to prove that, in accordance with the results of Theorem 5.2: 


m 7 2 f 
(mas, Sh + (Rig — 1) + EA] = dhe aE (max ky — 1) fae 


Thus, in the model 5.3, for estimating the size of operation computer memory 
necessary for simultaneously satisfying the jobs in g € Q, for each q € Q, we need 
not study all 2/ states preceding the transition of the system to state q. It suffices 
to consider one such state, characterized by a pair of partitions (p,, p;) such that 


Pp, = (ki,), Pr = Rigen RE); 


ki, = maxy;<ics ki, r = f — 1. Hence, to compute an upper bound for the necessary 
size of operation computer memory, as initial data we may take a set of partitions 
P(Q) each element p(q) = (k,,...,ky,) of which is the list of memory sizes required 
for implementing the jobs from the complete family g € Q corresponding to it. Using 
Theorem 5.2, to determine V’ it suffices to compute 


f 
V(Q) = mera) (3: k; + (max k; = 1) 7 F 
q 


The quantity V(Q) can be used as upper bound for the necessary size of operation 
computer memory in an automatic control system under both batch and individual 
methods of satisfying inquiries on memory. It is easy to prove that here the required 
amount of calculations, as in the previous cases, is determined by the set of initial 
data. In fact, 


max 
(Pr,Pt)C B(q) 


f f 
IP(Q)| = T]1Qil « 2” - TT lai 

i=1 i=1 
We note that the parts of each partition p(q) can be chosen, in accordance with 
the rule given in model 5.3, from one and the same list of values (k),... , k;), where k; 
corresponds to the necessary size of continuous address space of operation memory 
which is needed for implementing the ith job of the system and / is the number of 
possible data that can be implemented by the automatic control system. Using the 
fact that in finding V(Q) the quantity r = const, enumeration of the elements of 
P(Q) in the determination of V(Q) is completely excluded by the following lemma. 


LEMMA 5.1. Let A(ki,---,Ki,) = Dhar ki; + (maxrcicr bi; — 1), (hiys.--5ki,) € 
R, the set of all r-pairings of elements of the set N = {ki,,...,ki,}, ki > 0, ky = 
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--» >k,l>r. Then 


PR a ee ~ di +h hy 1) ir 


PROOF. Using the rule for constructing R, we determine the quantity 


max max k;. )} < max k; = ky, ki EN. 
(ki, 1. ki, JER l<i<r ? 1<i<l 


But max(g,, ,....4:,)ER 2ja1 Ki; S Vina kj, i.e. the maximum is determined by the sum 
of maximal elements of N, and in fact by the elements (k),...,k,). By definition, 
the elements of R are all possible r-pairings of elements of N, i.e. (k,,...,k,) € R, 


as required. O 


The result of Lemma 5.1 makes it possible to assume that the quantity V(Q) is 
determined by a partition p’(q) € P(Q) with maximal parts in comparison to other 
elements of P(Q). We will call p'(q) the extremal size of the software structure for the 
concrete distribution of functions over terminals of the automatic control system. 
An extremal section of a software structure has important value in studying and 
optimizing parameters characterizing the process of computer memory allocation. 
To find p'(q) without enumeration of the elements of P(Q) makes it possible to give 
a simple construction rule for P(Q), which uses the ordered list of a priori data 
regarding the sizes of inquiries on memory in the automatic control system. To sum 
up th e results obtained, we determine the order of actions in estimating from above 
the necessary size of operation computer memory in an automatic control system. 


6. The order of calculating a bound for the necessary size of operation 
computer memory in an automatic control system 


First it is necessary to analyze the existing data regarding the software structure 
implementing jobs in the automatic control system, and to tie them to data regarding 
the functional meaning of terminals of the automatic control system. To this end 
we represent data regarding the necessary sizes of memory needed for implementing 
jobs in the automatic control system by a list of values N = (k),...,k,). Depending 
on the stage of design under consideration, the elements k; (1 < 2 < l) of this 
list may be data regarding oriented memory sizes for implementing jobs, or data 
regarding the sizes of all programs and their arrays which make up the software of 
the automatic control system. Let N be the list of values of memory sizes needed 
for implementing the programs making up the software of the automatic control 
system. We number the elements of this list from 1 to /, and order them such that 
ky oe OA ky. 

To tie the initial data regarding the software structure to the functional meaning 
of terminals of the system, we must construct the set Q; (1 <i < f) whose elements 
are jobs intended to be initialized by the ith terminal. From N and Q; we then 
obtain, using the map ¢: N — Q; such that 


oz) = (hi: == (hy), ky EN 2EQ} (I Si<f, 


a list Qi(V) of memory sizes required for implementing the jobs making up the set 
Q;:. The elements of Q;(N), as well as those of N, are ordered by decreasing value. 
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Using the lists Q;(N) (1 < i < f) we determine the extremal section of the 
software structure of the automatic control system, p'(q), using the following rule: 


p'(q) = (ki; + ki; € Qm(N), J= ming elm)), leom<f, 


where f is the amount of terminals to be served by the automatic control system, 
and Lm is the set of indices of the elements in Qm(N), Lm C {1,...,U}. 

The calculation of an upper bound for the necessary size of operation computer 
memory in the automatic control system is, in accordance with Lemma 5.1, con- 
ducted on the elements of p'(q), in correspondence with the following expression: 


f 
V(Q) = ki; zg (ki, = lf, 


where ki, €p/(q), 1 <i<l. 

Thus, the successive solution of easy extremal combinatorial problems concerning 
packability in the set of number partitions, interpreted as the possible states of the 
system under investigation, completely excludes the enumeration of all its elements 
in finding the value of a functional defined on the parameters of the elements of this 
set. 

The rule for finding an extremal section of the software structure of the auto- 
matic control system has polynomial computational complexity. This guarantees 
a relatively fast solution of this problem on a computer, for software structures 
of practically any complexity. The selection of a sector of operation memory in 
correspondence with V(Q) is guaranteed by the use of the simplest method for 
controlling memory allocation, the software for whose realization are given by that 
unique first-fit memory allocation algorithm. The quantity V(Q) is an upper 
bound for the necessary size of computer memory, therefore, of course, it exceeds 
the real requirements on memory in implementing some software structure and col- 
lection of programming tools of the automatic control system. We can determine 
the efficiency of applying some extremal result regarding the packability of number 
partitions to compute an upper bound of the necessary size of operation computer 
memory. To this end, next to the comparison of the extremal results obtained with 
each other, it is useful to have some ‘absolute’ value of the parameters under inves- 
tigation. The method for comparing various extremal results regarding packability 
of number partitions which is most simple to state is that of comparison by the 
results of solving the packability problem for a pair of partitions definitely known 
to be packable. The simplest formulation of the problem in this case is to verify 
‘self-packability’ 


(ky,--., ke) & (ki, .-., ke). 


This formulation shows, for each extremal result giving a bound for the necessary 
size of free memory, how much memory will guarantee the simultaneous allocation 
of inquiries of sizes (k,,...,k) in fragmented address space of free memory with 
sizes (k,,...,k:) of free sectors. Using Dirichlet’s principle (within the model 5.7) 
for determining the necessary size of operation computer memory, in accordance 
with the approach chosen we obtain the bound tk —¢ + 1. 
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In the models 5.4—5.7 the method of complete allocation leads to the bound 


Chien het) = max (34 + (ki — 1)(t - »| 


for the necessary size of operation computer memory. By the upper bound in (2.27) 
the least. possible bound for the necessary size of operation computer memory is 
k- 1.5819, since 


kt k 
m(k,t,t) < i — (t—1) ee k- 1.5819. 


This result plainly shows to what amount the principle of complete allocation is 
more efficient than Dirichlet’s principle. 

The use of Theorem 2.3 to determine the necessary size of operation computer 
memory gives the following bounds. 


PROPOSITION 5.3. Let 


M(k,t)= max __n(ki,..., ke; ko,..- , ky) 


Then 
k + (t—1) ([E] —1) < M(k,t) < 2k- [A] -—t+1. 
PrROooF. The following equation holds: 
n(ky,. a3 kt; Ke, - faye) =k+ max (ki = 1)(2 = 1); 


In fact, 


RK aah kon ki = max (34 £5. ci kiki — »| = 


<i<t 122 
ras (So % A+ ==) = pth N6=D. 
77 Vel j=it+1 


The lower bound is realized by the most uniform partition: (]k/t[,...,]k/t[) F k, 
hence the inequality k; > [k/t] holds for the maximizing partition (k,,...,k:) Fk 
and its maximizing index 7. 

We now prove the upper bound by the method of contradiction. Assume that 
(¢-—1l)kij -i+1>k-— [k/t] t+ 1. Then 

(i-lk-it+1>k—[§]-e+1 Sh +---+h&—-[#]-#+1> 

2 t= kek kick Baa ees he (é] si LS 
> (i- Iki thy +t —i— [4] -t +1 = - 1k t+ he — [£] 41, 


or ki < [k/t], contradicting the previous remark. O 
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The given system of bounds, obtained by using results of solving extremal com- 
binatorial problems, plainly demonstrate the fact that with increasing a priori in- 
formation regarding the functioning of the elements under investigation the results 
become substantially more refined, i.e. the efficiency of applying combinatorial mod- 
els in these investigations increases. To stress this it suffices to compare the bounds 
given above. 

It is easy to prove that if the size of operation computer memory is chosen in 
agreement with models 5.4—5.6, then in the process of functioning of the automatic 
control system the efficiency of using memory does not exceed the value 


Dye ki, 
wi. ki, + (ki, — US 


Consequently, for a software structure of the automatic control system in which 
the elements of an extremal section guarantee that a condition 


- 100%. 


f 
So ki, > (ki, _ 1)f 
j=l 

holds, the efficiency of using operation memory will exceed 50%. It is clear from 
the last inequality that increase of the actual required size of memory, defined by 
the very quantity (ki_, — 1) f, uses (in accordance with the definitions in the models 
5.5-5.7) maximal influence of external fragmentation on the process of computer 
memory allocation. The appearance of external fragmentation is given here under 
the assumption that having f occupied sectors in memory leads to the formation 
of an (f + 1)st free sector, while the sizes of these free sectors are one less than 
the maximal possible size of an inquiry on memory which may arise during the 
functioning of the system. This assumption on the influence of external fragmenta- 
tion of memory reflects to a sufficient extent the actual process of functioning of a 
computer’s memory. The ‘50-percent rule’ proved in [41] testifies of this. However, 
this rule characterizes a steady-state regime of functioning of memory, or, according 
to the definitions in [41], an equilibrium state, in which there are on the average 
n occupied sectors in the system. Moreover, the 50-percent rule uses the notion 
of probability, which makes it impossible to use it as a characteristic of the phe- 
nomenon of fragmentation in calculating the size of operation computer memory 
which would guarantee satisfaction of every inquiry on memory without using the 
means of reorganization and re-allocation. 

The results given show that the required size of computer memory of an automatic 
control system, necessary for implementing the investigated software structure, is 
mainly determined by the elements of an extremal sector of the structure. Using 
the random nature of the process of functioning of memory, we suppose that. pre- 
cisely these elements determine the maximal influence of external fragmentation. 
Consequently, by investigating the whole set of states of address memory space, 
being determined by the various situations of simultaneous execution of elements of 
an extremal sector of the software structure, we can substantially refine the bound 
V(Q). These investigations can be conducted by using both the method of imita- 
tion modeling, as well as the method of extremal combinatorial analysis. The most 
important is that these investigations need not be conducted on the whole software 
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structure of the automatic control system, but only on the elements of an extremal 
sector of it. 

The quantity V(Q) and the extremal sector of the software structure are of great 
importance for work scheduling during the design and extension of functions of the 
automatic control system. In fact, for the scheduling of the computational process 
here considered the quantity V(Q) does not depend on the amount of programs by 
which some job of the automatic control system is executed. Therefore the choice of 
operation memory size in accordance with V(Q) makes it possible to unrestrictedly 
augment functions of the automatic control system without increasing the size of 
operation computer memory. This guarantees that the following single condition is 
fulfilled when extending functions of the automatic control system: 


the software realizing additionally introduced functions need not bring 
changes in the extremal sector of the software structure of the auto- 
matic control system. 


An extremal sector of the software structure can be found already at an early stage 
in the design of the automatic control system, using the mathematical apparatus 
expounded here. Consequently, data regarding the extremal sector can be noted 
in documentation on the design and implementation of software. Taking notice 
of these data as restrictions on the admissible sizes of program and informational 
modules allows one to fully exclude, in the creation of the system, the solution 
of problems related with selecting the size of operation computer memory of the 
automatic control system. 

Despite the fact that the quantities V, V’, and V(Q) are upper bounds for the nec- 
essary size of computer memory (which are even not applicable in realizing certain 
automatic control systems), they express an analytical dependence of such charac- 
teristics of automatic control systems as peculiarities of the software structure, the 
amount of terminals to be served by the automatic control system, the amount of 
functions to be implemented by it, and the functional meaning of the terminals. 
Consequently, the models 5.3-5.7 and the solutions of the extremal combinatorial 
problems obtained may be regarded as a mathematical apparatus for studying and 
optimizing the process of control of computer memory, the software structure, and 
the distribution of functions between terminals of the system. 

Moreover, the considered models of restrictions on controlling the computational 
process, prescribing the finishing of a program by emptying the memory occupied by 
it, in no way restrict the scope of the investigations conducted using this apparatus. 
When mapping the set of initial data (model 5.7) onto actual data concerning the 
software structure using the formal apparatus proposed, one may use a number 
of additional models, making it possible to study the working of programs with 
array data, the presence of re-entering and nonre-entering programs in the software 
structure of automatic control systems, and the implementation of other methods 
of controlling the course of the computational process. Consequently we have the 
possibility to study, by these models, all above given peculiarities of functioning of 
automatic control systems. 

For example, by correcting the software structure with regard to the sizes of pro- 
grams and arrays, and also by shifting certain programs into the stock of residing 
programs, one obtains the possibility of optimizing the structure and of controlling 
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the computational process with regard to resource demands on operation computer 
memory; one may study the influence of re-entering of a program, for a given al- 
location of functions between the terminals of the automatic control system, on 
the necessary size of operation memory, etc. It is here important to note that the 
proposed approach to investigations allows us to solve two classes of combinatorial 
problems: problems concerning the determination of the presence of an admissible 
solution (on the packability of partitions), and problems of constructing theoretically 
substantiated algorithms for realizing this solution in polynomial time (an algorithm 
for packing partitions). 

In conclusion we note that the expense of processor time on the organization of 
control of computer memory allocation in an automatic control system with highly 
dynamical incoming inquiries during functioning, constitutes more than 1/3rd_ of 
the overall time for solving functional problems. The choice of the size of operation 
memory using the proposed approach makes it possible to implement the easiest 
method for controlling allocation in the system. This reduces the inproductive 
expense of processor time on the organization of control of allocation of operation or 
additional computer memory of the automatic control system. The software realizing 
this organization of control of operation memory is almost exclusively the program 
of the first-fit algorithm. By excluding from the functions of computer software 
of the automatic control system the tools for controlling the operation memory, 
and by implementing the first-fit algorithm, one can substantially increase the 
productivity of a system of this class. 


7. Deciphering of passwords 


In computers and computer networks one uses passwords, i.e. words consisting 
of the letters, ciphers and signs available on a keyboard. This is done with the 
purpose of restricting user access to certain programming means. For example, 
before entering the American Mathematical Society database one must choose the 
passwords e-math. 

Assume that we do not know a password necessary for us to enter a program or 
database. How to guess this unknown password? 

One of the methods is given in problem 3.27. However, it turns out that the 
weighing problem can also be presented differently. Each sign in a password must be 
taken from a certain set of values (letters, ciphers and signs; e.g. 40 different ones), so 
we may suppose that each sign is taken from the 40 distinct numbers 1, 2,..., 40. In 
fact, letters, ciphers and signs are coded in the computer (in the binary system) just 
like this. So, the question of determining a password is equivalent to determining 
a system of independent loads for parallel weighing with as many weights as the 
word is long. Therefore the most economic system of weights (i.e. numbers) for 
deciphering a password can be determined from the results of Chapter 2 concerning 
single-pan weighing on parallel weights. That is, the inequality 


t 
SO Sil higecs kar) 
i=1 


implies that the fastest sequence for deciphering two-letter passwords is determined 
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by the system 


a 
ki <1 + Dour i=1,2,... 
j=1 


and has the form 1,1, 2,3, 4,6,9, 14, 21,31,.... 
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